Questions tagged [text-extraction]

Text extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents (text).

Text extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents (text).

Text extraction mechanisms may vary depending on the context and the language applied. Approaches may vary from regular expressions to classifiers till more complex/custom models.

More Info

1282 questions
-2
votes
3 answers

Extraction of unique values form a array list

I'm pretty new to programming in Java and I want to make a program that will print out some values from a file. I want to import a array list from a file which contains a large set of repeated numbers. The program should print out only one unique…
Ivn Bubrov
  • 43
  • 1
  • 8
-3
votes
0 answers

Trying to extract Handwritten text from image using deskew function. I'm facing this issue for the past 3 days

[can anyone pls help me out on this. can't get it](https://i.stack.imgur.com/Jpler.png) Actually need to extract text from image,for that just downloaded the handwritten text for testing if it works fine .then got an error like (-215 assertion)but…
Monika
  • 1
-3
votes
2 answers

How to extract and replace text by using a Linux Script

file: input_file.xml ACTIVE Active Role DEFAULT Default Role I need to extract all the text between…
kikin
  • 1
  • 1
-3
votes
3 answers

JavaScript RegEx : Extract the first matched value from a Multiline Text

Well, I've a Multiline Text like below. I want to extract the first Matched Content value (Here, 10) after Messi: using JavaScript Regular Expressions. The Matching String (Messi: ) Can be on any line. Neymar: 11 Messi: 10 Ronaldo: 7 Chhetri:…
Jenson M John
  • 5,499
  • 5
  • 30
  • 46
-3
votes
1 answer

If you had text file full of people's names where each person's name was on a separate line, how would you turn that into a clean list of their names?

How do you quickly extract people's names from a text string with a python script? A General Description For any person p if two lines L1 and L2 contained person the name person p then line L1 is the same line as line L2 For any two different…
Toothpick Anemone
  • 4,290
  • 2
  • 20
  • 42
-3
votes
2 answers

How can I extract sentences with certain text in a spreadsheet?

I got a spreadsheet which looks like this. I will like to keep the file column, but extract only the sentences with the word "India". Is there a way to do that? Prefer to use KNIME or R, but happy with any solution. Only the sentences with "India"…
-3
votes
1 answer

What format is easiest ("most lossless") to parse in python? xml, epub , xhtml or pdf?

I need to convert a few large documents to a database and I have the files in xml, xhtml, epub and pdf. Assuming the files themselves are completely faultless, which of these formats will enable me to extract the text with the least mistakes and…
Olli
  • 906
  • 10
  • 25
-3
votes
1 answer

php regex preg_match only amount

Hello world I have some problem with regex i want to get some parts of the string Example 1: here i want to get the amount "1234.0" and the date "21-Jun-2021" also the mb number "20010598278" 21-Jun-2021 Bank Fees - Funds Transfer - MB #20010598278…
-3
votes
1 answer

Python extract parameters from url

I need your help on this, I have a url something like this url = "https://tracking.example.com:443/attribution_tracking/conversions/1980.js?p=https://example.com/search?addsearch=test+search&e=" Need some python code to extract the url parameters…
Eric John E.
  • 73
  • 1
  • 1
  • 7
-3
votes
1 answer

How to i extract a particular parameter and it's data from multiple text files?

Suppose i have multiple files such as file1 file2 and file3 and consists of following data: file1: {"status":"succes","message":"User Found","error_code":"0","data":{"phone":0,"name":"Sanju…
D - Lord
  • 3
  • 2
-3
votes
2 answers

C# Extracting String and Double from line of string

I have a line from which I need to extract a Name and the result (double) of that name. The line looks like this: James: 8, John: 8, Jasmin: 12, Igor: 1.54, Garry: 0, Gabe: 0.12, Lauren: 0, Grace: 81.31 I need each of them as an array of Name and…
Oran Band
  • 1
  • 1
-3
votes
2 answers

How to extract the content of a text file within a scope delimined by a string marker

I have a Console File, I need to match a string ("Seq Started"), And if I get the string I want to copy all text till I get another string("Seq Ended") in a txt file.
-3
votes
1 answer

I want to extract a certain paragraph from a text type website using for loop

website = https://www.sec.gov/Archives/edgar/data/3662/0000950170-98-000413.txt I want to extract paragraph named by "MANAGEMENT'S DISCUSSION AND ANALYSIS OF FINANCIAL CONDITION AND RESULTS OF OPERATIONS" in text format in python from the given…
-3
votes
1 answer

Is there any way to do partial String matching in R?

I have 2 data frames. First has more number of rows and one ID column "ALP23456" and other related columns. Second has lesser number of rows and the ID value is present as a comment "ALP23456 done on 26th March". This is a free text and no pattern…
Koyeli
  • 67
  • 1
  • 9
-3
votes
2 answers

how to extract data from a code using python

I have some functions, as .txt files which are written using easy language and I need to extract data from those functions using python. as an example consider the following part. code segment - If MarketPosition = 0 and (EntriesToday(Date) < 1 or…