Questions tagged [codepoint]

A CodePoint is a numeric value that make up the unicode codespace.

CodePoint may represents a character or also have other meanings (seven fundamental classes of code points in the standard: Graphic, Format, Control, Private-Use, Surrogate, Noncharacter, Reserved).

Related tags

unicode
UTFs utf-8 utf-16, utf-32

116 questions

vote

1 answer

How to sort strings in JavaScript by code point values?

I need to sort an array of strings, where elements are compared lexicographically as sequences of code point values, so that, for example, "Z" < "a" < "\udabc" < "�" < "". Is there a more efficient way of comparing strings, other than manually…

javascript unicode comparison codepoint

asked Nov 27 '21 at 13:03

abacabadabacaba

2,662
1
13
18

vote

0 answers

Get first printable character from a string

This might seem like an already answered question, but I couldn't find it anywhere. How do I get the first printable character in Java? For example abcd //should return "a" - The first printable char is of 1 bytes //should return "" - The…

java character-encoding special-characters emoji codepoint

asked Oct 04 '21 at 07:15

Pankaj Singhal

15,283
9
47
86

vote

1 answer

Reading Glyphs from a String using codePointAt(i) or Charseterset issue

I created a text editor for JavaFx which is painting the text on a Canvas, gyph by glyph. I use String.codePointAt(i) to correctly load the glyphs. Somehow the first glyph is a strange one, I don't know why. The file was loaded using Charset UTF-16…

java string encoding character-encoding codepoint

asked Sep 21 '21 at 15:21

DbSchema

vote

1 answer

In java what's different between Character.isBmpCodePoint and Character.isValidCodePoint

In java what's different between Character.isBmpCodePoint and Character.isValidCodePoint? I mean, I know 0x10FFFF and 0xFFFF, but what does it imply? Which should I use?

java unicode codepoint

asked Sep 13 '21 at 14:18

FredSuvn

1,869
2
12
19

vote

0 answers

Why are codepoints in the block CJK UNIFIED IDEOGRAPHS EXTENSION B not named according to the group pattern

In the Java standard library, Character.getName(0x2000A) returns "CJK UNIFIED IDEOGRAPHS EXTENSION B 2000A" (in java 11, 16 and 17, using unicode version 10 and unicode version 13), while I expected "CJK UNIFIED IDEOGRAPHS-2000A" The result…

java unicode character codepoint

asked Sep 01 '21 at 16:42

Martijn

11,964
12
50
96

vote

1 answer

Build a token for Simplified Chinese Identifiers

I'm trying to build a token for Simplified Chinese Identifiers. Simplified Chinese Identifiers are defined in the specification as follows: simplified-Chinese-identifier = first-sChinese-identifier-character…

unicode cjk codepoint gb2312

asked Aug 13 '21 at 04:32

SoftTimur

5,630
38
140
292

vote

1 answer

How do I reverse `String.fromCodePoint`, i.e. convert a string to an array of code points?

String.fromCodePoint(...[127482, 127480]) gives me a flag of the US (). How do I turn the flag back to [127482, 127480]?

javascript string unicode-string codepoint

asked May 21 '21 at 07:42

ppt

vote

0 answers

why Unicode codepoint escape syntax doesn't work in php

i am confuse about Unicode codepoint escape syntax. here is a demo //this work fine echo "\u{1f602}"; // echoes //this doesn't work $var = '1f602'; echo '"\u{' . $var . '}"';// out put \u1f602 after i search. i find eval will let it work…

php unicode escaping codepoint

asked Mar 14 '21 at 11:11

miracle00001

vote

1 answer

Character Issues

Back Story I basically retrieve strings from a database. I alter some text or those strings. Then I upload those strings back to the database, replacing the original strings. After looking at the front-end that displays those strings, I noticed the…

java character-encoding codepoint

asked Sep 02 '20 at 17:45

SedJ601

12,173
3
41
59

vote

2 answers

java string unicode code point convert to character

Ok, so I feel like this question for asked many times but I am not able to find an answer. I am comparing two different files that were generated by two different programs. Of course both programs are generating the files from the same db queries. I…

java string unicode codepoint

asked May 18 '11 at 22:11

Mohamed Nuur

5,536
6
39
55

vote

1 answer

"Width" of character on screen

I'm using Ncurses to write text editor. I would like to know if there is a way to determine how many different characters can be placed on screen, where each of the character is encoded with UTF-8. For example when I get screen width of 10 and one…

c++ ncurses codepoint

asked Apr 08 '19 at 03:41

Mateusz Wojtczak

1,621
1
12
28

vote

1 answer

Codepoint mismatch between Java and C

So, I'm having some problems with the following char – in a port of imgui to kotlin After digging the whole day into Charsets and encodings, I came down to my only hope: rely on the unicode codepoints. That char on the jvm "–"[0].toInt() // same as…

java c kotlin codepoint imgui

asked Apr 03 '19 at 16:34

elect

6,765
10
53
119

vote

1 answer

How can I tell if a Unicode code point is one complete printable glyph(or grapheme cluster)?

Let's say there's a Unicode String object, and I want to print each Unicode character in that String one by one. In my simple test with very limited languages, I could successively achieve this just assuming one code point is always the same as one…

java c# unicode glyph codepoint

asked Aug 23 '18 at 22:06

Jenix

2,996
2
29
58

vote

1 answer

Efficient lookup table for unicode code points

Wondering how typically a unicode code point lookup table is done. That is, given a character such as a, return U+24B6, or vice versa. Wondering if there are any efficient tricks so that it doesn't just boil down to: a: U+24B6 b: ... c: ... Which…

optimization data-structures unicode encoding codepoint

asked Jun 21 '18 at 05:46

Lance

75,200
93
289
503

vote

3 answers

What is the most idiomatic way to convert a string to characters in Erlang?

What is the most idiomatic way to convert this: "helloworld" to ["h","e","l","l","o","w","o","r","l","d"] in Erlang ?

string erlang codepoint

asked Mar 13 '18 at 05:44

Muhammad Lukman Low

8,177
11
44
54

Prev 1 2 3 4

6 7 8 Next

Questions tagged [codepoint]

Related links

Related tags