Questions tagged [icu]

The International Components for Unicode is a set of cross-platform open source C/C++ and Java libraries for Unicode and globalization support. It is developed by the Unicode Consortium. Use this tag when you have a question about using the ICU. Be sure to include the appropriate language tag and the version of ICU that you are using.

The International Components for Unicode supports C/C++ (ICU4C) and Java (ICU4J), and is available under an open source license. See the ICU-TC home page for documentation and licensing details.

It uses .

771 questions
0
votes
1 answer

How to convert estimated time to string with ICU library

How to convert estimated time to string with ICU library?
angel
  • 1
  • 1
0
votes
1 answer

How to use ICU without setting ICU_DATA directory

I'm trying to use ICU4C on windows. I copied the 2 dlls into the executable directory and a ICU data file. According to what I read in the documentation, I should not need to set a specific data directory (http://userguide.icu-project.org/icudata),…
Tobias Langner
  • 10,634
  • 6
  • 46
  • 76
0
votes
2 answers

How to get an ICU converter from windows code page number

I need a platform independent way to convert from a string that is encoded with a windows codepage that I know the number of (and nothing else) to a unicode string and back. I thought that icu4c might be able to do that but I did not find the…
Tobias Langner
  • 10,634
  • 6
  • 46
  • 76
0
votes
1 answer

Can Code page and locale differ?

On Windows 7 (64 bit) I set Japan as the locale. On command prompt following is reflected: LC_ALL: English_United States.1252 LC_CTYPE: English_United States.1252 chcp command: Active Code Page: 932 (which is Japanese) My question is while…
user1989504
  • 133
  • 1
  • 13
0
votes
0 answers

7 bit ascii with icu4j

I'n trying to use icu4j library. I want to receive a text, determine an encoding and then to convert it into 7 bit ascii format i.e. to use &#number instead of a symbol from extended ascii table (whose value is greater than 128). for detecting an…
user2957954
  • 1,221
  • 2
  • 18
  • 39
0
votes
1 answer

How to encode code point list into UTF8 chunk using libICU?

Due to program requirements (fast access to individual characters), I am using uint32_t[] for characters. Simply stores code points, not UTF-32 code units. because I don't think UTF-32 code-unit and Unicode code-point is same thing, so I have to…
eonil
  • 83,476
  • 81
  • 317
  • 516
0
votes
1 answer

Do PHP 5.5' intl extension and PECL intl package origin from the same source code?

Recently I had to move from the PHP 5.5's native intl extension to the intl PECL package for the purpose of keeping intl/ICU up to date. But now I wonder if the behavior of intl code in my PHP code base has remained completely unchanged. My code…
Desmond Hume
  • 8,037
  • 14
  • 65
  • 112
0
votes
1 answer

Reversing order of udata_setCommonData

The documentation for ICU states that Starting with ICU 4.4, it is possible to set several data packages, one per call to this function. udata_open() will look for data in the multiple data packages in the order in which they were…
tofutim
  • 22,664
  • 20
  • 87
  • 148
0
votes
1 answer

Implementation of realtime break iterator

I'm interested in modifying the break iterator data (zh) as my program is running as the user adds new words. This means that the data cannot be originally packaged and must be generated as I go. Can I use something like udata_setAppData or…
tofutim
  • 22,664
  • 20
  • 87
  • 148
0
votes
2 answers

Problems building ICU

I'm trying to build ICU on Solaris Sparc 64 and it's giving me this error when calling make: * No rule to make target uconvmsg/libuconvmsg.a(uconvmsg/uconvmsg_dat.ao', needed byuconvmsg/libuconvmsg.a'. Stop There seems to be a bug with make 3.8.2…
Dao Lam
  • 2,837
  • 11
  • 38
  • 44
0
votes
1 answer

Regex for finding strings with reasonable "Year/YEAR" pattern

I am searching for a very specific regex expression that will help me find strings that meet the following rules: The string must contain the keyword 'Year' or 'YEAR' The keyword needs to be separated from other text by non-letters OR the keyword…
lbrendanl
  • 2,626
  • 4
  • 33
  • 54
0
votes
1 answer

Browser/parser for icudt48l.dat on Android?

New releases of one mail program fail to run on my Android phone. Program authors ignored bug report for half year then closed it like invalid because "there maybe (sic!) some (sic!) method is missing in Charset class on your phone, ask your vendor…
Arioch 'The
  • 15,799
  • 35
  • 62
0
votes
1 answer

How can i add more locales to Symfony/Icu 1.0.x?

Symfony2 Form localization now depends on Symfony/Intl and Symfony/Icu components. If the server you run you application has ICU lib version lower than 4.0 you must use the Symfony/Icu 1.0.x component which emulates the ICU lib. The problem is that…
Laurynas Mališauskas
  • 1,909
  • 1
  • 19
  • 34
0
votes
4 answers

iterate over characters blocks in utf-8

My task is to iterate over all the utf-8 character codes corresponding to a given language (locale). I suppose it's not that easy and I have to iterate over characters blocks (like the whole cyrilic for "ru_RU", for example). I can find characters…
Alexander
  • 1,299
  • 2
  • 12
  • 32
0
votes
1 answer

Can parameters in MessageFormat strings be optional?

I am using the MessageFormatter class to format translated strings to display to the end user. The catch is that I also need to insert some html markup into the final result. For example, the string might just be Peter liked a photo, however, I need…
F21
  • 32,163
  • 26
  • 99
  • 170