Questions tagged [mining]

235 questions
0
votes
0 answers

Text lemmatization in pyspark return a TypeError: 'Column' object is not callable

I spent a lot of time trying to find the solution to this one. I am working with pyspark on a text column dataframe that I tokenized, and I am trying to lemmatize it using nltk but this gives back an error: TypeError: 'Column' object is not…
k_bm
  • 81
  • 1
  • 10
0
votes
1 answer

Can't collect website's source code when CSS is disabled

Thanks for checking out this question! I'm teaching myself how to collect web data. The objective is to collect reviews of 'booking(dot)com' listings within a city. I'm using requests library in order to collect the source code and find useful data.…
nnarefun
  • 89
  • 1
  • 7
0
votes
2 answers

geth states eth_submitHashrate while mining with Claymore on Windows 10 with 2 GPU's

I am aiming on GPU-mining Ethereum on a Windows 10 PC with 2 Radeon RX590. geth version is 1.9.9-stable-01744997 cmd call to start geth: geth --rpc --syncmode "fast" --cache 4096 --etherbase [ADR] --datadir "[MyDataDir]" --mine --minerthreads…
user3440145
  • 793
  • 10
  • 34
0
votes
1 answer

what is the best config for xmr-stak to mine monero?

My tech spec is dual intel gold 6140 (36 cores 2.3ghz) 96gb ram, 2 800gb ssd (raid) and 2 nvidia v100 32gb cards. (i don;t have access to bios to overclock) I was wondering what would be the best config and setup to get the most hashes? currently…
0
votes
1 answer

Geth (go-ethereum) uses 100% CPU during mining even with 1 thread is specified

Steps to reproduce: Run geth with parameters --mine --minerthreads "1" or --mine --minerthreads 1 Expected behaviour: Only 1 thread is used. Actual behaviour: All CPU threads (8) are used with a 100% load. System information: Geth version: 1.9.6 OS…
0
votes
1 answer

Merging tags into my file using named entity annotation

While learning the basics of text mining i run into the following problem: I must use named entity annotation to find and locate named entities. However, when found, the tag must be included in the document. So for example: "Hello I am Koen" must…
Koen Wijnen
  • 55
  • 1
  • 8
0
votes
2 answers

Has anyone used Dijkstra's algorithm in OPL?

I have a model for a mining problem. I am working on adding into the model to use the shortest path inside a mine(open pit) for hauling ore and waste. For this, I was thinking of Dijkstra's algorithm. I could not find any example of the use of…
Ranajit
  • 49
  • 6
0
votes
2 answers

I can't seem to count the frequency of my itemsets

I have a transaction data and I'm trying to get a count of all the possible combination. The problem I'm getting is that it seems to over count my combinations. For example: given the following item sets: A {1,2,3} B {1,2,3,4} if I want to count…
semidevil
  • 69
  • 1
  • 11
0
votes
1 answer

Decreasing support threshold for arules in R

I am working on association rules that are considered outliers. I noticed that arules does not show results for rules that have a support less than .10. Is there any way that I could view rules that have a support of .1 (10%) or less? I tried the…
aduran3
  • 3
  • 2
0
votes
1 answer

I want to parse multiple HTML documents with beautiful soup but I can't make it work

Is there a way to use beautiful soup to parse multiple HTML documents at the same time? I am modifying the code online that extracts HTML.txt files from edgar with beautiful soup so they can be downloaded as formated files: however, I found that my…
0
votes
0 answers

What i have to do after i'm working with tf idf and chi square

In text mining, i wanna ask what i have to do after I did a tf idf and chi square for terms. Tf idf is like a different file with chi square. Right? Is it right if i did a tf idf matrix times chi square weight for every terms?
0
votes
2 answers

Extracting part of string based on two conditions

I have a character column in my data set from which I want to extract part of a string based on two conditions: a) if the string starts with "Therapist:", split the string to two columns: one column with the word "Therapist" and the other column…
0
votes
1 answer

Tweepy still not returning full text despite using extended text feature

I am using tweepy to download tweets about a particular topic but nobody which tutorial I follow I cannot get the tweet to output as a full tweet. There is always an ellipse that cuts it off after a certain number of characters. Here is the code I…
0
votes
0 answers

How to fix color of plot display 2d python data from tf idf?

I have tried code from this link plot a document tfidf 2D graph from sklearn.feature_extraction.text import CountVectorizer, TfidfTransformer from sklearn.decomposition import PCA from sklearn.pipeline import Pipeline import matplotlib.pyplot as…
yyywd
  • 21
  • 5
0
votes
1 answer

Find Hidden Miners in Go (Hidden windows + commandlines)

I found this C# and I want to improve on it in Go: https://github.com/roachadam/MinerKiller/blob/master/MinerKiller/MinerKiller.cs My first question, is how do I detect if a process window is hidden. ie this code: if (p.MainWindowHandle ==…
dom
  • 321
  • 2
  • 12