Questions tagged [mining]

235 questions
0
votes
1 answer

Questios about Specific code of R in text mining and seeking some suggestions

I am trying to do text mining recently and Seeing the code, I have the whole picture about what it's trying to do about the text. But the problem is on some specific part of code, I don't know why the format is this way, and what the parameters…
user43480
  • 11
  • 2
0
votes
1 answer

Marking or Tagging Non-structured Data (Text data) with combination of words

I m working on unstructured data (text). I want to tag back the data with some key words and combination of key words. I am not able to tag back the data with the combination of words. I want to know where "fraud" AND "misselling" is occurring. I…
Abhishek Kapoor
  • 51
  • 1
  • 1
  • 3
0
votes
2 answers

Rapidminer XPath extract forum post text

I am new to repidminer and for an academic project I am trying to extract text information from forum posts such as http://www.tripadvisor.com/ShowTopic-g29220-i86-k1487815-Alamo-Maui_Hawaii.html I have tried a lot already and came up with the XPath…
hag o hi
  • 117
  • 1
  • 1
  • 9
0
votes
1 answer

text mining for filtering search

I m developing question answering system in java,in tht I have created templates manually which will be match to user asked question. Problem is after pre processing i have list of Keywords and these keywords I want to match with keywords in stored…
blueWings
  • 71
  • 1
  • 9
0
votes
0 answers

Extract Google search URLs using R & Extract Texts from each URLS

I need to extract Google search URLs & Extract Texts from those each URLs using R. The goal is to classify each websites that appears from the Google search. Please help if any one knows how.
0
votes
1 answer

Web hosting does not support cron jobs. How can I run my scraping script remotely to mine data and then send it to my hosting's mysql database?

I am currently using Yahoo web hosting. I found out that Yahoo does not support cron jobs. What I want to do is run my script and feed the data it mines into Yahoo's mysql database. The script is written in PHP, takes a couple hours to complete, and…
Charity
  • 1
  • 1
0
votes
1 answer

How do you convert all the pdfs in a directory, into txt format, via R?

I'm trying to convert a list of PDF files located in my computer directory, into txt format so that R can read it and begin text mining. Do you know what is wrong with this code? library(tm) #load text mining library setwd('D:/Directory') #sets R's…
stochastiq
  • 269
  • 3
  • 17
0
votes
0 answers

How do I preserve table structure after using TM to clean up a CSV file?

I am in need of some help with TM with one task. I have a csv file with ~300 rows and 42 variables with some variables having NA values. I want to use TM to clean this file up before I load the data to a NLP application. Specifically, I want to…
0
votes
1 answer

CGMiner error: SEM: Icarus USB timeout waiting for

I set up CGMiner 3.8.1 (even the one before this version) on MACOS X 10.8.5 I have 11 Block Erupters but the CGMiner just see 10 and keep giving this error: SEM: Icarus USB timeout waiting for (blah blah). I already changed usb hub, plugged the usbs…
0
votes
2 answers

search twitter and collect search results from 'mining the social web' examples

I'm reading the code for 'mining the social web 2nd E' on here and I'm trying to understand how example 6 is working! I'm trying to print the length of statuses and is outputting different results, below I will display two code snippets and the…
tkyass
  • 2,968
  • 8
  • 38
  • 57
0
votes
0 answers

stratum-mining how to setup

here are some lines of a configuration file for stratum protocol: parser.add_argument('-o', '--host', dest='host', type=str, default='198.20.177.17', help='Hostname of Stratum mining pool') parser.add_argument('-p', '--port', dest='port',…
zuperakos
  • 355
  • 4
  • 18
0
votes
1 answer

R web/text mining - web query JSON read

In the Blekko search engine you can get the search results in JSON format, e.g. with the search term 'lifehacker': http://blekko.com/ws/?q=lifehacker+%2Fjson How could you carry out this query from R and parse the content? [There is a URL, a RSS…
Henk
  • 3,634
  • 5
  • 28
  • 54
0
votes
1 answer

Columnstore Indexing technology in MySQL with vector based batch processing

I am using MySQL as a database and as the database is large, it has been indexed columnwise. But now there is the requirement to use the MSSQL like method for indexing as it has been said to perform faster whose link is as…
KCP
  • 929
  • 9
  • 29
0
votes
1 answer

how to find ranges in continuous values for use in building decision tree

I am building a decision tree that uses fields with continuous values (doubles). how should I create the range nodes to build the tree with (finding the best ranges values)
user882611
  • 47
  • 1
  • 6
0
votes
1 answer

How to build a keyword-related graph from an initial input keyword?

As a beginner text-miner, I really want to ask for advices/guidelines on graph mining, based on a real need of me: build a keyword-related graph from an initial input keyword. I know the topic is relatively large, so I want to do it for Twitter…
karmiphuc
  • 81
  • 1
  • 9