Questions tagged [text-extraction]

Text extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents (text).

Text extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents (text).

Text extraction mechanisms may vary depending on the context and the language applied. Approaches may vary from regular expressions to classifiers till more complex/custom models.

More Info

1282 questions
-4
votes
1 answer

Words extraction

I've a txt file, structured like that: *ABC123 a ABC123 , / comment here DEF452 hju then ABC123 h DEF452 hh, 6HH , 7JJ , 8LL, Z123 enf of file \ \ *ABC124 a ABC124 , / comment here DEF4552 hju then ABC124 h DEF4566 hh, …
Luke
  • 503
  • 2
  • 7
  • 17
-5
votes
2 answers

regex as dictionary to extract numbers from 1 to 6 R

i want to create a regex function that takes the codes and set it like a reference dictionary to parse into the corpus and set them into a TDM with their occurrences corpus<- Corpus(DirSource(path)) dictionary <- regexpr(("") , corp) …
user10181179
-5
votes
1 answer

Extract email id from text file by showing path

my text file is saved on my desktop which contains my resume i want to extract my E-mail from the text file can some one help me My txt file has Name : K. Spandana Gender : Female Nationality…
SAKETH
  • 9
  • 5
-7
votes
2 answers

how to extract a specific portion of text from text file in r?

i have many text files containing text given below. \\ Paper: hep-th/9201003 From: DIJKGRAAF%IASSNS.BITNET@pucc.PRINCETON.EDU Date: Thu, 2 Jan 92 14:06 EST (54kb) Title: Intersection Theory, Integrable Hierarchies and Topological Field…
Alvi
  • 123
  • 1
  • 3
  • 14
-7
votes
4 answers

How to extract string using C#

I need to extract the Password value which is in bold (Password10) from the text given below. I am using c# programming language. FName Lname, your system password was changed. If you did not change it or do not know why it was changed, you should…
1 2 3
85
86