#11 2013-08-02 03:37

eR@SeR
Senior Member
From: Земун, Србија
Registered: 2008-01-23
Posts: 356

Re: Problem with "Ñ" Caracter

den4b wrote:

In the example Pequeñas.mp3 we actually have Pequen~as.mp3 in binary.

SafetyCar wrote:

I have tested now the insert rule, the compound ñ does also alter the count of character. Being possible to insert something between both separating them.

Confirmed!

Also:

1) Delete: Delete from Position 1 until Count reaches 3 (right-to-left) (skip extension)

"Shows" those two characters combined into one i.e. removes ~ from filename.

then

1) Remove: Remove last "n" (skip extension)

Prince Royce - Las Cosas Pequeñas.mp3   --->  Prince Royce - Las Cosas Pequẽas.mp3

1) Strip: Strip characters "ABCDEFGHIJKLMNOPQRSTUVWXYZ" (skip extension)

Prince Royce - Las Cosas Pequeñas.mp3   --->  -   ̃.mp3

But...:

1) Remove: Remove all "~" (skip extension)

...remains unchanged. Same when is putted  ̃ character hmm

den4b wrote:

Another option is to perform Unicode Normalization possibly as an option of Cleanup rule, prior to using any other rule where those characters could cause issues.

How can be done? Using Prepare for SharePoint (nothing happens)?

den4b wrote:

You have just opened a can of worms!

Yep, looks like that roll


TRUTH, FREEDOM, JUSTICE and FATHERLAND are the highest morale values which human is born, lives and dies for!

Offline

#12 2013-08-02 08:21

SafetyCar
Senior Member
Registered: 2008-04-28
Posts: 446
Website

Re: Problem with "Ñ" Caracter

I would like to point out that the normalization function would be also useful also on PascalScprit, where there are already others like WideToAnsi and the opposite.


If this software has helped you, consider getting your pro version. :)

Offline

#13 2013-08-11 13:48

den4b
Administrator
From: den4b.com
Registered: 2006-04-06
Posts: 3,479

Re: Problem with "Ñ" Caracter

I have implemented special handling for combining characters (e.g. COMBINING TILDE U+0303) within Case rule when performing capitalization of words. Letters in front of such characters will no longer be capitalized.

Unicode normalization feature is noted for future development, together with SafetyCar's comment about adding it to PascalScript.

Offline

Board footer

Powered by FluxBB