Questions tagged [thai]

The national language of Thailand whose unique script presents various challenges when processing or rendering text.

The national language of Thailand whose unique script presents various challenges when processing or rendering text:

  • Non-spacing / combining marks for some vowels and tone marks
  • Some tone marks stack vertically
  • Some vowel letters are not rendered visually in the same order they are stored logically as data
  • Spaces are not used between words
106 questions
1
vote
1 answer

Postgresql Encoding Issue Using tsearch With Thai Text After Conversion from MariaDB using Pgloader

I am trying to convert a MySQL UTF8mb4 database which contains both Thai and English to Postgresql. This appears to go well until I try and add tsearch. Let me outline the steps taken. Install this Thai parser…
gordonbanderson
  • 221
  • 2
  • 5
1
vote
0 answers

change character ก to Ď during xls2csv -d utf-8 /source.xls >desination.csv

When I convert an XLS file to CSV using xls2csv there is a problem that the character 'ก' is converted to 'Ď'. My command is: xls2csv -d utf-8 /test.xls>/test.csv Other characters of Thai and other languages work normally.
user3559721
  • 134
  • 8
1
vote
2 answers

ICU combining Thai vowel signs and when to ignore

I'm processing Thai keyboard input. Some of the keys are vowel signs and only allowed when combined with certain preceding characters. Here 0x0E33 is the vowel sign For example 0x0E1C + 0x0E33 is valid but 0x0E44 + 0x0E33 is not valid and the 0x0E33…
user2233208
1
vote
1 answer

Convert \xc3\xd8\xe8\xa7\xc3\xb4\xd to human readable format

I am having trouble converting '\xc3\xd8\xe8\xa7\xc3\xb4\xd' (which is a Thai text) to a readable format. I get this value from a smart card, and it basically was working for Windows but not in Linux. If I print in my Python console, I get: ����ô I…
Sijan Shrestha
  • 2,136
  • 7
  • 26
  • 52
1
vote
1 answer

Get the correct graphical representation of Thai combination characters

I'm trying to write a bitmap font that can render Thai (amongst others). I've got the character set but a lot of the characters are "combining character" codepoints. I'm using Graphics.DrawString to get a bitmap of the characters. This worked fine…
Luc Bloom
  • 1,120
  • 12
  • 18
1
vote
1 answer

urllib2 doesn't get same html string as normal browsers with same agents (encoding error?)

I m trying to get the page from this site http://www.francais-thai.com/dicoweb/fran/00012.htm but in python (there is thai text in the page) here is the code I tried: (it should download the page) # -*- coding: utf-8 -*- import urllib2 agents =…
Arnaud Aliès
  • 1,079
  • 13
  • 26
1
vote
0 answers

Transliterate Thai to Latin using RTGS in ICU Project

I am using the ICU Project to perform some transliterations, but I am finding some issues while transliterating from Thai to Latin. I would need to use the RTGS (https://en.wikipedia.org/wiki/Royal_Thai_General_System_of_Transcription) standard, but…
aperezra
  • 165
  • 2
  • 7
1
vote
1 answer

Thai string manipulation issues caused by incorrect string length

I am trying to highlight some substring in a Thai text: high = high.Insert(myString.Index + myString.Length + "" + currentLength, ""); The issue is, that the myString string contains a special Thai character ("เงินฝาก"). The given string…
1
vote
1 answer

how to convert numeric to thai language word in java

I used following code but it not work import java.text.NumberFormat; import java.util.Locale; public class NumToword { public static void main(String str[]){ String outputString = new String(); Locale[] thaiLocale…
Brijesh
  • 148
  • 2
  • 16
1
vote
2 answers

Unable to return query Thai data

I have a table with columns that contain both thai and english text data. NVARCHAR(255). In SSMS I can query the table and return all the rows easy enough. But if I then query specifically for one of the Thai results it returns no rows. SELECT TOP…
Hammertime
  • 59
  • 1
  • 11
1
vote
1 answer

SQL WHERE clause does not match on columns with Latin and Thai characters

I'm cleaning up a SQL Server 2012 database which contains some nvarchar columns containing mixed character set data (Latin and Thai script). For example: As I have a number of columns with the same TariffDescription text, I want to apply a…
Mark Micallef
  • 2,643
  • 7
  • 28
  • 36
1
vote
1 answer

Ionic app does not display lao/thai characters on Android device

I am developing an application that uses a webfont to display thai/lao characters. To do so I am defining a custom font-face as follows: @font-face { font-family: 'SaysetthaWeb'; src: url('./saysettha-web.eot') format('embedded-opentype'); …
ceyquem
  • 2,079
  • 1
  • 18
  • 39
1
vote
3 answers

NSURL with Thai language

I have problem with NSURL and Thailand language for example "http://www.xyp.com?var=ไทย" it does't work but if you use "http://www.xyp.com?var=Thai" it ok. I don't know how to solve this problem anyone please help me.
mr.octobor
  • 191
  • 2
  • 4
  • 18
1
vote
0 answers

Thai combining char

Recently I'm dealing with a issue with open Thai named file in folder in development. Usually we do this by: Java: ProcessBuilder with "explorer /n,/select,/folder/อักษรไทย.jpg" Windows C API: ShellExecuteW with L"explorer" L"open"…
Luly
  • 11
  • 2
1
vote
1 answer

Finding non-spacing characters in Objective-C

How do I check if a character in my NSString is a non-spacing character? For example: NSString *string3 = @"เบียร์เหล้า"; // 12345678 <- only take up 8 spaces The NSString length is 11, but actually the space that it occupies is…
user774150
  • 962
  • 2
  • 10
  • 24