Questions tagged [sframe]

SFrame is a tabular data structure used to store millions of rows of data that do not necessarily fit into memory for analysis.

SFrame is Python package realized scalable, out-of-core dataframe which allows to work with datasets that are larger than the amount of RAM on your system.

Homepage: https://pypi.python.org/pypi/SFrame

82 questions
1
vote
2 answers

What is use of SFrame.materialize() in Graphlab?

When I was trying to get the rows of my dataset belonging to column of userid =1 through graphlab's sframe datastructure, sf[sf['userid'] == 1], I got the rows,however I also got this message, [? rows x 6 columns] Note: Only the head of the SFrame…
hgr
  • 178
  • 1
  • 1
  • 10
1
vote
1 answer

Cannot change the None values in an SFrame using apply()

I am using this code to change the None values in the SFrame: sarr['value'] = sarr['value'].apply(lambda x: '-1' if x is None else x) After running this code, I still see the same None values, no effect. Any ideas what is going on here?
renakre
  • 8,001
  • 5
  • 46
  • 99
1
vote
2 answers

how to take average of the values per three days in a timeseries data

I have a time-series data that look like this: datetime | value 2016-01-01 | 1 2016-01-02 | 14 2016-01-03 | 3 2016-01-04 | 15 2016-01-05 | 5 2016-01-06 | 4 2016-01-07 | 7 2016-01-08 | 15 What I want to obtain is the average per three…
renakre
  • 8,001
  • 5
  • 46
  • 99
1
vote
2 answers

Using regex to extract information from a large SFrame or dataframe without using a loop

I have the following code in which I use a loop to extract some information and use these information to create a new matrix. However, because I am using a loop, this code takes forever to finish. I wonder if there is a better way of doing this by…
renakre
  • 8,001
  • 5
  • 46
  • 99
1
vote
1 answer

SFrame manipulation slows down after adding of a new column

I am building a repeat orders report in ipython notebook using graphlab and sframes. I have a csv file with roughly 100k rows of data containing user_id, user_email, user_phone. I added a new column called unique identifier. For each row I am…
Shami
  • 11
  • 2
1
vote
1 answer

Converting a unique columns into SFrame headers with corresponding values

I have a tab-separated file: $ echo -e 'abc\txyz\t0.9\nefg\txyz\t0.3\nlmn\topq\t0.23\nabc\tjkl\t0.5\n' > test.txt $ cat test.txt abc xyz 0.9 efg xyz 0.3 lmn opq 0.23 abc jkl 0.5 $ python >>> from sframe import SFrame >>> sf =…
alvas
  • 115,346
  • 109
  • 446
  • 738
1
vote
0 answers

Graphlab Sframe - Remove an empty column

I have many sframes each having around 25 columns, I need to run many different classifiers on these. But, there are few columns which would be completely empty for particular Sframes. I need to get a list of columns which are empty for particular…
Dreams
  • 5,854
  • 9
  • 48
  • 71
1
vote
3 answers

Error while loading graphlab.SFrame('home_data.gl/')

I am doing Machine Learning Course from Coursera by University of Washington. In which I am using iPython's graphlab. During practise when I execute below command: sales = graphlab.SFrame('home_data.gl/') I am getting error. IOError …
Chinesh
  • 125
  • 1
  • 11
1
vote
1 answer

SFrame from numpy array

I would like to create an SFrame from a NumPy array. What i want specifically is: np.arange(16).reshape(4, 4) => +----+----+----+----+ | 0 | 1 | 2 | 3 | +----+----+----+----+ | 0 | 1 | 2 | 3 | | 4 | 5 | 6 | 7 | | 8 | 9 | 10 | 11 | |…
Vladimir
  • 369
  • 1
  • 3
  • 12
1
vote
4 answers

python log() function TypeError

i have a data set called "sales" which is a SFrame. there's a column called "sqft_living" and i want to convert it to log value. data type of column is float. but when i try to convert it with log() function it asks a float although its already a…
Sampath Rajapaksha
  • 111
  • 1
  • 1
  • 11
1
vote
1 answer

Select a specific Row in SFrame

I've been baffled on how to select a specific row within a SFrame array. I'm able to select the first row here: sf +-------------------------------+ | X1 | +-------------------------------+ | [0.0, 0.0, 0.0, 0.0, 0.0, ...…
jKraut
  • 2,325
  • 6
  • 35
  • 48
1
vote
1 answer

Fit scikit-learn algorithms with data stored in SFrame

Is it possible to use data stored in Sframe to train e.g., a Random Forest, of scikit-learn implementation without converting the whole dataset to numpy?
1
vote
1 answer

graphlab: How to load a gzipped file into SFrame

I would like to read a gzipped csv file () to SFrame provided by GraphLab. https://dato.com/products/create/docs/generated/graphlab.SFrame.read_csv.html I tried the following code, but it did not work. import graphlab import…
Benben
  • 1,355
  • 5
  • 18
  • 31
1
vote
1 answer

Python: Iterate an operation across different columns of one row for all rows of a graphlab.SFrame

There is a SFrame with columns having dict elements. import graphlab import numpy as np a = graphlab.SFrame({'col1':[{'oshan':3,'modi':4},{'ravi':1,'kishan':5}], 'col2':[{'oshan':1,'rawat':2},{'hari':3,'kishan':4}]}) I want to…
Oshan
  • 176
  • 15
1
vote
1 answer

Find particular rows in Graphlab or Python

In Graphlab, I am working with a small subset of movies from a larger list. movieIds_5K_np = LL_features_SCD_min.to_numpy()[:,0] ratings_33K_np = ratings_33K.to_numpy() movieIds_5K_np is an array containing my movieIds. `ratings_33K_np' is an…
Yas
  • 811
  • 4
  • 11
  • 20