Questions tagged [text-processing]

Mechanizing the creation or manipulation of electronic text.

Text processing includes basic processing jobs using filter, tokenization or normalization method to process text. This could be a pre-processing step for .

See also:

1959 questions
0
votes
2 answers

awk to compare time field for 24 hours data

I have below test data and I have been able to get data for current date, How can I compare time to get me data from 00:00:01 to 08:00:00? Test1 prog 02/20/2017 03:45:56 Success Test2 prog 02/20/2017 05:21:38 Fail Test3 prog 02/20/2017 …
Sid
  • 161
  • 1
  • 10
0
votes
1 answer

N combinations of words, simple bash

File 1: 1F 2F 3F 4F 5f File 2: 1F 2F 3F 4F 5f I have a simple code that produces all possible combinations #!/bin/bash for a in $(awk '{print $1}' intf1) do for b in $(awk '{print $1}' intf2) do echo -e…
SysRq308
  • 125
  • 4
0
votes
0 answers

Is there a similar command in R to ' diff -y ' command in Linux?

As a newbie in R, I am wondering is there any command or packages in R which can provide and perform similar function in Linux as: $ diff -y file01 file02 the -y flag is important because it can compare side by side. I looked into some already…
Daniel
  • 1,202
  • 2
  • 16
  • 25
0
votes
1 answer

Create a new annotator in Stanford CoreNLP

I need to create and add a new annotator for analysing text in Stanford CoreNLP. There are solution to add a new annotator. I can't find how to create a new annotator? I need to create a neww annotator for Newsfeed source names in Stanford CoreNLP…
Tariq
  • 2,274
  • 4
  • 24
  • 40
0
votes
3 answers

VB.net loop and Split function text processing

I have a string variable as: p_text = "}{X=45,Y=65}{X=59,Y=65}{X=59,Y=79}{X=45,Y=79}{" with this code Dim p_text As String Dim s1_texts As String() Dim s2_texts As String() Dim s3_texts As String() s1_texts = Split(p_text, "}{") Dim numi As…
nsssayom
  • 364
  • 1
  • 5
  • 21
0
votes
3 answers

How to Return Everything After 2nd Occurance of String

I was wondering what the best way to get everything after the second occurrence of a string would be. I have files like this: --- title: Test Document creation_date: 01-29-2016 --- Text, blah blah blah More text, blah blah blah So I have file…
tedm1106
  • 127
  • 1
  • 10
0
votes
0 answers

How to parse a file for multiple strings?

I have a large number of HTML files some of which contain a section starting "
specific text" and ending with
. I'd like to use a bash script to remove these sections. There are many other div sections, some of which overlap with the one…
0
votes
1 answer

How to use mashape with python request library?

Unirest is not compatible with python3 and that's the library that mashape APIs' use on python projects. I've decided to use python request library to make a POST request, but I'm getting a 400 HTTP error. Everything looks good to me, but I can't…
0
votes
1 answer

Does naive bayes classifier perform text annotation?

Does naive bayes classifier perform text annotation (sequence tagging)? If yes, i need a tutorial please. What do you think about MALLET for naive baies?
Fasun
  • 73
  • 10
0
votes
1 answer

appending string with integer values python re

I was trying to insert a date into a string, using regular expression in python link = 'branch=;deps=;date=;rev=;days=1;user=' date = "10.12.2016" re.sub(r'(.*)(date=[^;]*)(.*)','\\1\\2'+date+'\\3',link) I was expecting the output to be…
Gopikrishnan R
  • 11
  • 1
  • 1
  • 4
0
votes
1 answer

convert ruby array to regular string

i have an array like this text_arr = ["hello","how","are","you"] and i want to convert this to string like this text = "hello how are you" How can i do this with Ruby ?
watermelon
  • 33
  • 6
0
votes
1 answer

How to check if a file contains a specific set of characters (ex: ^&^)

I have a file which is delimited with ^&^. Here is a snippet from the file. XML_DOC^&^NUM^&^GEO_REF_ID^&^GRL I need to perform some operations based on the delimiter. How can I check if the file has ^&^ ? I have tried the below code but that did…
Alex Raj Kaliamoorthy
  • 2,035
  • 3
  • 29
  • 46
0
votes
1 answer

swapping variable name and variable type

I have a text file with a long list of variables like this: a VARCHAR(32), b INT, c TINYINT, . . . I want to quickly swap the order of the name and type so I have: VARCHAR(32) a, INT b, TINYINT c . . . Im happy to use a bash terminal or notepad…
Connor Bishop
  • 921
  • 1
  • 12
  • 21
0
votes
1 answer

Put file names into text

I have at least 40 files (20 in xml. format and 20 in safe. format). As example some of this: iw1-20150612.SAFE.safe iw2-20150714.SAFE.safe iw1-20150612.xml iw2-20150714.xml I want to put this file names in a txt file in a certain place of text via…
user2899758
  • 157
  • 7
0
votes
1 answer

Merge all .wav files in a folder that start by the same prefix

I have filelists that look like…
Ruan
  • 65
  • 1
  • 1
  • 9
1 2 3
99
100