Questions tagged [mining]
235 questions
0
votes
1 answer
Questios about Specific code of R in text mining and seeking some suggestions
I am trying to do text mining recently and Seeing the code, I have the whole picture about what it's trying to do about the text.
But the problem is on some specific part of code, I don't know why the format is this way, and what the parameters…

user43480
- 11
- 2
0
votes
1 answer
Marking or Tagging Non-structured Data (Text data) with combination of words
I m working on unstructured data (text).
I want to tag back the data with some key words and combination of key words.
I am not able to tag back the data with the combination of words. I want to know where "fraud" AND "misselling" is occurring.
I…

Abhishek Kapoor
- 51
- 1
- 1
- 3
0
votes
2 answers
Rapidminer XPath extract forum post text
I am new to repidminer and for an academic project I am trying to extract text information from forum posts such as http://www.tripadvisor.com/ShowTopic-g29220-i86-k1487815-Alamo-Maui_Hawaii.html
I have tried a lot already and came up with the XPath…

hag o hi
- 117
- 1
- 1
- 9
0
votes
1 answer
text mining for filtering search
I m developing question answering system in java,in tht I have created templates manually which will be match to user asked question.
Problem is after pre processing i have list of
Keywords and these keywords I want to match with keywords in stored…

blueWings
- 71
- 1
- 9
0
votes
0 answers
Extract Google search URLs using R & Extract Texts from each URLS
I need to extract Google search URLs & Extract Texts from those each URLs using R.
The goal is to classify each websites that appears from the Google search.
Please help if any one knows how.

user1925750
- 77
- 7
0
votes
1 answer
Web hosting does not support cron jobs. How can I run my scraping script remotely to mine data and then send it to my hosting's mysql database?
I am currently using Yahoo web hosting. I found out that Yahoo does not support cron jobs. What I want to do is run my script and feed the data it mines into Yahoo's mysql database. The script is written in PHP, takes a couple hours to complete, and…

Charity
- 1
- 1
0
votes
1 answer
How do you convert all the pdfs in a directory, into txt format, via R?
I'm trying to convert a list of PDF files located in my computer directory, into txt format so that R can read it and begin text mining. Do you know what is wrong with this code?
library(tm) #load text mining library
setwd('D:/Directory') #sets R's…

stochastiq
- 269
- 3
- 17
0
votes
0 answers
How do I preserve table structure after using TM to clean up a CSV file?
I am in need of some help with TM with one task. I have a csv file with ~300 rows and 42 variables with some variables having NA values. I want to use TM to clean this file up before I load the data to a NLP application. Specifically, I want to…

Jose Morales
- 61
- 4
0
votes
1 answer
CGMiner error: SEM: Icarus USB timeout waiting for
I set up CGMiner 3.8.1 (even the one before this version) on MACOS X 10.8.5
I have 11 Block Erupters but the CGMiner just see 10 and keep giving this error: SEM: Icarus USB timeout waiting for (blah blah).
I already changed usb hub, plugged the usbs…

Igor Almeida
- 21
- 4
0
votes
2 answers
search twitter and collect search results from 'mining the social web' examples
I'm reading the code for 'mining the social web 2nd E' on here and I'm trying to understand how example 6 is working!
I'm trying to print the length of statuses and is outputting different results, below I will display two code snippets and the…

tkyass
- 2,968
- 8
- 38
- 57
0
votes
0 answers
stratum-mining how to setup
here are some lines of a configuration file for stratum protocol:
parser.add_argument('-o', '--host', dest='host', type=str, default='198.20.177.17', help='Hostname of Stratum mining pool')
parser.add_argument('-p', '--port', dest='port',…

zuperakos
- 355
- 4
- 18
0
votes
1 answer
R web/text mining - web query JSON read
In the Blekko search engine you can get the search results in JSON format, e.g. with the search term 'lifehacker':
http://blekko.com/ws/?q=lifehacker+%2Fjson
How could you carry out this query from R and parse the content?
[There is a URL, a RSS…

Henk
- 3,634
- 5
- 28
- 54
0
votes
1 answer
Columnstore Indexing technology in MySQL with vector based batch processing
I am using MySQL as a database and as the database is large, it has been indexed columnwise. But now there is the requirement to use the MSSQL like method for indexing as it has been said to perform faster whose link is as…

KCP
- 929
- 9
- 29
0
votes
1 answer
how to find ranges in continuous values for use in building decision tree
I am building a decision tree that uses fields with continuous values (doubles).
how should I create the range nodes to build the tree with (finding the best ranges values)

user882611
- 47
- 1
- 6
0
votes
1 answer
How to build a keyword-related graph from an initial input keyword?
As a beginner text-miner, I really want to ask for advices/guidelines on graph mining, based on a real need of me: build a keyword-related graph from an initial input keyword.
I know the topic is relatively large, so I want to do it for Twitter…

karmiphuc
- 81
- 1
- 9