pgsql.hackers
[Top] [All Lists]

Re: proposal: UTF8 to_ascii function

Subject: Re: proposal: UTF8 to_ascii function
From: "Pavel Stehule"
Date: Tue, 12 Aug 2008 07:17:58 +0200
Newsgroups: pgsql.hackers
2008/8/11 Peter Eisentraut <peter_e@xxxxxxx>:
> On Monday 11 August 2008 16:23:29 Jan Urbański wrote:
>> Often clients want their searches to be
>> accented-or-language-specific letters insensitive. So searching for
>> 'łódź' returns 'lodz'. So the use case is there (in fact, the lack of
>> such facility made me consider not upgrading particular client to 8.3...).
>
> These are valid ideas, but then please design a new function that addresses
> your use case in a well-defined way, and don't overload questionable old
> interfaces for new purposes.
>
> In the Unicode standard you can find well-defined methods to decompose
> characters into diacritic marks, and then you could strip them off.  But this
> has nothing to do with ASCII or UTF8 or encodings.  Cyrillic characters can
> have diacritic marks as well, for example.
>

Hi Peter,

changes to_ascii from text to bytea is more bugfix than new feature
and should be done immediately. Correct conversions are related to
colum collation - is not necessary repeat same work and same code from
some unicode libs.

Regards
Pavel

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers
<Prev in Thread] Current Thread [Next in Thread>
Privacy Policy