Questions tagged [thai]

The national language of Thailand whose unique script presents various challenges when processing or rendering text.

The national language of Thailand whose unique script presents various challenges when processing or rendering text:

  • Non-spacing / combining marks for some vowels and tone marks
  • Some tone marks stack vertically
  • Some vowel letters are not rendered visually in the same order they are stored logically as data
  • Spaces are not used between words
106 questions
3
votes
2 answers

Match a Thai Script character in Java

Over last two hours I have a lot of sexy time with Thai Script strings that slipped in my database. They collate mysteriously, mutate when output, do not have natural order and are a disaster. I want to just ignore any strings with Thai Script…
alamar
  • 18,729
  • 4
  • 64
  • 97
3
votes
1 answer

OpenNLP gives error when using Thai model

I have tried to follow the advice from here, but I got this error: C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin>opennlp TokenizerME C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin\thai.tok.bin <…
Music
  • 133
  • 1
  • 1
  • 7
3
votes
3 answers

Set up a Database with NLS_CALENDAR parameter as 'Thai Buddha'

I want to set up a database (Oraclev 12.1.0.2) with NLS_CALENDAR parameter as 'Thai Buddha', at the same time I need to keep character set as AL32UTF8. When I do this at setting up NLS_CALENDAR parameter does not get changed at DB level. SELECT…
Dyn
  • 71
  • 8
3
votes
1 answer

Rendering thai characters with tags breaks on Apple devices?

I'm trying to render Thai characters with tags wrapped around them: So for instance: (อัน) ัน I want to display the character between the italics in a different color. Works fine on my Windows machine and my Android phone. But on iPhone and…
Rob
  • 10,851
  • 21
  • 69
  • 109
3
votes
1 answer

Java : In console Thai text is printing as some strange character

When I am printing thai character in console it is showing some strange character. public static void main(String[] args) throws Exception{ byte[] bytes = "ฝ่ายขาย".getBytes("TIS-620"); String str = new String(bytes); …
Arvind
  • 1,207
  • 6
  • 27
  • 55
3
votes
1 answer

Manipulating Thai Characters in PHP

I'm struggling getting Thai characters and PHP working together. This is what I'd like to do: But instead of giving me the first character of $string (ท), I just get…
2
votes
1 answer

Find word boundaries in Southeast Asian Languages (Thai, Khmer, Lao, Myanmar)

Many languages have spaces between words, so it's easy to know when to wrap to a new line. For example, in english, any space represents an opportunity to wrap lines. However, languages like Thai have wrapping between words, but no spaces to tell…
Dennis L
  • 1,713
  • 14
  • 21
2
votes
1 answer

MonoDevelop not support display Thai font

I'm now learning iPhone development with Monotouch and use Mono Develop for IDE. Everything works fine and I'm going to buy a license for MonoTouch. However, the IDE can not display Thai text correctly. It just display [] that is difficult for me to…
embarus
  • 815
  • 3
  • 14
  • 28
2
votes
2 answers

Parse date in specific format using Thai Calendar in Android

For an Android app, I need to parse date in Thai local format (as like, year 2019 should be returned as 2562). I do not understand how can I do so. Currently using SimpleDateFormatter to parse the date in default local format. fun…
Sudip Podder
  • 830
  • 11
  • 25
2
votes
0 answers

Is Google AMP supported in Thailand / for Thai language?

During a visit to Google Headquarter in Bangkok, we learned that Thailand was the only country not supporting AMP. (more specifically Thai language as English pages works just fine). As this meeting was last March, I was just wondering if it's still…
2
votes
3 answers

Thai character not rendered correctly in PDF

My app should be able to output a PDF file containing the user guide in several supported languages. (I'm using pdfkit) I had some troubles finding a suitable font for Thai: some so-called Thai supported languages (included Noto Thai from Google)…
2
votes
1 answer

Output Arabic/Thai text in Excel file using Photoshop JavaScript

I have a script for photoshop which outputs the name of a text layer and layer content into Excel CSV file. It works fine if the text is english but if the text is Arabic/Thai it was displayed like this "??????". How can I display these texts…
Divine
  • 21
  • 2
2
votes
0 answers

How does the Chrome browser know where the word boundaries are in Thai text?

Text in Thai is written with no spaces between the words. Instead, spaces indicate a break, like a comma or the end of a sentence. For example, the string พูดไปสองไพเบี้ย นิ่งเสียตำลึงทอง means "Speech is cheap; silence is golden" with the space…
James Newton
  • 6,623
  • 8
  • 49
  • 113
2
votes
0 answers

Dealing with CountVectorizer in other language

I used CountVectorizer from Scikit-learn to count the occurrence of Thai word which already tokenized and then tried to vectorizing the text in Python 3 Code: #-*-coding: utf-8 -*- w_vector = ["การ", "ซ้อม"] v = CountVectorizer(vocabulary=w_vector,…
boated_tw
  • 414
  • 1
  • 9
  • 19
2
votes
2 answers

How do I properly render Thai font in libGDX?

I'm making an app to help me remember some Thai words I'm learning. However I can't get text to render correctly. I used this example to create a basic scene. This is what I have so far. public class ThaiWords extends ApplicationAdapter { …
Johnny
  • 35
  • 4