Questions tagged [icu]

The International Components for Unicode is a set of cross-platform open source C/C++ and Java libraries for Unicode and globalization support. It is developed by the Unicode Consortium. Use this tag when you have a question about using the ICU. Be sure to include the appropriate language tag and the version of ICU that you are using.

The International Components for Unicode supports C/C++ (ICU4C) and Java (ICU4J), and is available under an open source license. See the ICU-TC home page for documentation and licensing details.

It uses .

771 questions
0
votes
1 answer

utext_openUTF8 return error for string "mod-12k";

In following small program I cant figure out why the utext_openUTF8 return with error.. int utf8_is_it_model3(std::string input_string) { int error_number=0; UText *ut = NULL; UErrorCode status; //char *pEnd; const char * c2 =…
user1675224
  • 213
  • 2
  • 12
0
votes
2 answers

Set before ERA dates in ICU4J in JAVA

I'm using ICU4J calendar library for converting dates between calendar systems (Gregorian, Japanese, Chinese, Jalali, Islamic,...). I use it with normal dates (after ERA) like this : Calendar gc = new GregorianCalendar(); gc.setTime(dateOBJ); Is…
Ariyan
  • 14,760
  • 31
  • 112
  • 175
0
votes
1 answer

How to detect charset encoding with php icu extensions?

ICU has functions to detect charset encodings, here and PHP also has ICU extension, here but how to use php icu extension to do the charset detection? I can not find anything related to this part on php's doc
Mickey Shine
  • 12,187
  • 25
  • 96
  • 148
0
votes
1 answer

ICU: Create a case-insensitive collation

I am creating a custom build of SQLite3 and I want to implement case-insensitive text columns. Is it possible to create a case-insensitive collation?
0
votes
2 answers

perl libSqliteIcu.so collate icu

The libSqliteIcu works fine in sqlite3 terminal with: sqlite3 ... .load './libSqliteIcu.so' SELECT icu_load_collation('pl_PL', 'POLISH'); SELECT DISTINCT miasto FROM tab ORDER BY miasto COLLATE POLISH; How to code the sequence above into perl?
zeta0
  • 101
  • 3
0
votes
1 answer

ICU implementation code for BreakIterators

I was looking to modify the ICU source code, but it's quite impenetrable to me and I was unable to find the piece that I was looking for. I was hoping to find the code (and hopefully some explaining comments/documentation) on the BreakIterators…
Puppy
  • 144,682
  • 38
  • 256
  • 465
0
votes
2 answers

Detecting word boundaries and the language for each word from UTF8 buffer

I'm developing a custom search engine and I need to pass each word to the appropriate language specific stemmer. I've recently discovered Compact Language Detector (CLD)…
0
votes
1 answer

icu4 anc freetype

I want to know what is the difference between ICU4 and freetype. Many application or libs uses either of these two like webkit. what is the relation between these two? which one is better.
dead programmer
  • 4,223
  • 9
  • 46
  • 77
-1
votes
1 answer

different results of regex matching among ICU library, Rust and PCRE(https://regexr.com/)

here is the pattern I used : "\w+|[^\w\s]+" when I match string "abc.efg" and "戦場のヴァルキュリア3" using PCRE in https://regexr.com/, it give me results like this: "abc" "." "efg" => 3 parts "戦場のヴァルキュリア" "3" => 2 parts that looks like right. But when I…
Damons
  • 153
  • 1
  • 1
  • 7
-1
votes
1 answer

'IM003', '[IM003] system error 126: The specified module could not be found (When using ICU library)

Building C/C++ code in windows for the first time in my career, I need to make a change in a 'legacy' (odbc) code to use a new (ICU, static) library. I made code changes via Vidual studio (2015) and added include and lib dependncies via project…
rahman
  • 4,820
  • 16
  • 52
  • 86
-1
votes
2 answers

[Docker][PHP] NumberFormatter::formatCurrency incompatible between 7.4.29 and 7.4.28

(Same as https://github.com/docker-library/php/issues/1301 ) The results of running the following script were different between php:7.4.29-fpm-alpine and php:7.4.28-fpm-alpine.
sogaoh
  • 34
  • 3
-1
votes
1 answer

icu::SimpleDateFormat use of undefined type compiler error

I cannot figure out why the following code (ICU example) will not compile: void test() { icu::Locale locale = icu::Locale("en"); UErrorCode status = U_ZERO_ERROR; DateTimePatternGenerator* generator = …
Caroline Beltran
  • 888
  • 2
  • 9
  • 22
-1
votes
1 answer

How to tweak ICU's UnicodeString::caseCompare (or get the same effect)

I'm not super familiar with how case-folding/case-insensitive comparisons work, and ICU in general. Right now, we have some methods that wrap various overloads of UnicodeString::caseCompareand I want to change them to do something…
Bwmat
  • 4,314
  • 3
  • 27
  • 42
-1
votes
1 answer

Convert .po file into ICU4C .txt file

I've tried to create an ICU4C file from a gettext .po file with a sed script like this: /^#/ d /* delete comments */ :a;/"$/{N;s/"\n"//;ba} /* merge quoted lines in loop */ /^msgid /s/msgid (.*)/\1/ /*…
DanMan
  • 11,323
  • 4
  • 40
  • 61
-1
votes
1 answer

How to design a regex with a single OR condition that checks the beginning and end of a pattern?

I'd like to create a regex for a passcode where users signify their number is a passcode with words and a double colon before the code or a hash after the word. So I'd like both of the following to be accepted: code: 1234567 123567# However as '#'…
Declan McKenna
  • 4,321
  • 6
  • 54
  • 72
1 2 3
51
52