How do you convert an RTF string to plain text in Java? The obvious answer is to use Swing's RTFEditorKit, and that seems to be the common answer around the Internet. However the write method that claims to return plain text isn't actually implemented... it's hard-coded to just throw an IOException in Java6.
Asked
Active
Viewed 3.1k times
4 Answers
21
I use Swing's RTFEditorKit in Java 6 like this:
RTFEditorKit rtfParser = new RTFEditorKit();
Document document = rtfParser.createDefaultDocument();
rtfParser.read(new ByteArrayInputStream(rtfBytes), document, 0);
String text = document.getText(0, document.getLength());
and thats working.

morja
- 8,297
- 2
- 39
- 59
-
Got it working using this solution, though it didn't work at first either. It turns out that my input data was invalid, and the conversion was failing silently and returning an empty string. – edm3 Apr 29 '11 at 18:13
-
1It works for me, but for some reason the text comes out with dropped characters. – george_h Dec 19 '13 at 08:41
-
It works fine in WIndows platform, but for *nix platform, it has dependecy with X11 window server. – Valijon Feb 16 '16 at 14:42
-
it doesn't work with "\line" code – Ekaterina Ivanova iceja.net Jun 12 '22 at 13:18
2
You might consider RTF Parser Kit as a lightweight alternative to the Swing RTFEditorKit. The line below shows plain text extraction from an RTF file. The RTF file is read from the input stream, the extracted text is written to the output stream.
new StreamTextConverter().convert(new RtfStreamSource(inputStream), outputStream, "UTF-8");
(full disclosure: I'm the author of RTF Parser Kit)

Jon Iles
- 2,519
- 1
- 20
- 30
-
Nice job! But "Kotlin: Unresolved reference: MyRtfListener" – Ekaterina Ivanova iceja.net Jun 13 '22 at 13:53
-
Does it mean that I must implement IRtfListener myself? – Ekaterina Ivanova iceja.net Jun 13 '22 at 14:09
-
@CatherineIvanova the one line example above will extract plain text for you... no need to implement the listener. I think your reference to `MyRtfListener` comes from the RTF Parser Kit readme which does illustrate the case where you'd provide your own listener. – Jon Iles Jun 13 '22 at 15:54
-
@JonIles thanks for the parser project! Can you see also this question https://stackoverflow.com/questions/74111741/get-files-from-an-rtl-input-in-java ? – yaylitzis Oct 18 '22 at 13:23
0
Here is the full code to parse & write RTF as a plain text
import java.io.FileInputStream;
import java.io.FileWriter;
import java.io.IOException;
import java.io.InputStreamReader;
import javax.swing.text.BadLocationException;
import javax.swing.text.Document;
import javax.swing.text.rtf.RTFEditorKit;
public class rtfToJson {
public static void main(String[] args)throws IOException, BadLocationException {
// TODO Auto-generated method stub
RTFEditorKit rtf = new RTFEditorKit();
Document doc = rtf.createDefaultDocument();
FileInputStream fis = new FileInputStream("C:\\SampleINCData.rtf");
InputStreamReader i =new InputStreamReader(fis,"UTF-8");
rtf.read(i,doc,0);
// System.out.println(doc.getText(0,doc.getLength()));
String doc1 = doc.getText(0,doc.getLength());
try{
FileWriter fw=new FileWriter("B:\\Sample INC Data.txt");
fw.write(doc1);
fw.close();
}catch(Exception e)
{
System.out.println(e);
}
System.out.println("Success...");
}
}

hitesh007
- 51
- 5