You are not logged in.
Pages: 1
Dear denis firstly i should thank you for these really amazing tool
i tried butch rename my .html files in to ":HTML_Title:" but these is failed because of invalid file names here is my title MÜLK here is how i see by preview mode MÃœLK
thx for ur efforts
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<title>MÜLK</title>
Last edited by ademmm (2009-02-24 11:58)
Offline
ReNamer extracts :HTML_Title: tag as plain ANSI text without using the specified charset, which in your case is UTF-8.
To fix this in general, I would have to extract the specified charset and convert it to Unicode. This is a bit of a problem, because there are hundreds of different charsets.
But for UTF-8, you can use PascalScript to quickly fix this. Try the code below, for the starter:
var
Title: String;
begin
Title := CalculateMetaTag(FilePath, 'HTML_Title');
FileName := UTF8Decode(Title) + ' ' + FileName;
end.
Offline
thx a lot that's worked for me but some html title have ?/| not supported characters i get the some message with yellow aler icon.
its will be magnificent if application self ignore these types of characters
for example
i get the the message?
alert - invalid filenames
i get the message
application ignore ? character
Offline
I don't know how to do that in pascal script but you could just add a strip rule, marking User defined, and there you put the symbols you don't want.
If this software has helped you, consider getting your pro version. :)
Offline
If you know the original character set (which may or may not be something easily determined.... it might be a case of trial-and-error if you really don't know), you could probably use what I have of my Iconv script to handle the character set conversion to Unicode. In which case, you set up the script to pull out the metadata information, then feed that into iconv, and hope that the magic happens.
Using Iconv is more complicated, but applies if the original text encoding is not Unicode.... ie, since the topic of this thread is talking Turkish, that the original text encoding could be ISO 8859-3, ISO 8859-9, Windows-1254, etc...
Offline
Pages: 1