I'm looking for a language translation API/solution that would fit my use case.
My use case is the following:
- The data is 1 GB of free unstructured text written mostly in the world's common languages (French, Spanish, German, Russian, Korean). The language of each piece of text is known.
- We can assume the text is grammatically correct and consists of complete sentences, but contains some uncommon words such as chemical compound names.
- The text has to be translated to English.
- The solution must be at least 10x cheaper than Google Translate which charges $20 per 1M characters.
- I would be willing to trade some of the Google's quality for cost-effectiveness. Google, Yahoo, Microsoft, Yandex, Online-Translator.com are all good enough, just too expensive.
I've got a 16 CPU machine at my disposal so offline translation is an option too.
Any suggestions?