Questions tagged [sframe]

SFrame is a tabular data structure used to store millions of rows of data that do not necessarily fit into memory for analysis.

SFrame is Python package realized scalable, out-of-core dataframe which allows to work with datasets that are larger than the amount of RAM on your system.

Homepage: https://pypi.python.org/pypi/SFrame

82 questions
0
votes
1 answer

GraphLab and SFrames

I am new to python and I'm taking a machine learning course. I have few experience of GraphLab and SFrames. I'd like to know how to apply a conditional and logical operation in case of data stored as SFrames. The following code and output might…
0
votes
1 answer

Error Sframe iPython

I'm attending a ML course on Coursera, and I'm having trouble opening a file with SFrame, when I try, this error appears: sf = graphlab.Sframe.read_csv('people.csv') ******AttributeError Traceback (most recent call…
Mastodon87
  • 325
  • 3
  • 6
  • 14
0
votes
1 answer

Using .apply() in Sframes to manipulate multiple columns of each row

I have an SFrame with the columns Date1 and Date2. I am trying to use .apply() to find the datediff between Date1 and Date2, but I can't figure out how to use the other argument. Ideally something like frame['new_col'] =…
javsckc
  • 1
  • 1
0
votes
4 answers

Using apply() method on SFrame issues

Background: I have an SFrame that contains numbers indicting how close a dog image is to other images. Usually the dog image should be closest to another dog image but the point is to test the evaluation method My SFrame is called dog_distances…
0
votes
1 answer

Describe and Info for SFrame

It would be nice to see summary of the SFrame, something similar to what pandas DataFrame gives you with methods .info() and .describe() What is the easiest way to do this except sf.to_dataframe().info(), sf.to_dataframe().describe() ? UPD: Where…
Vladimir
  • 369
  • 1
  • 3
  • 12
0
votes
2 answers

Python crashes with SFrame

I am doing a course on ML on Coursera. I am using graphlab for this course When I execute the below line, my python gets crashed. Please help me in solving this. I have no clue why it crashes everytime sf = graphlab.SFrame('people-example.csv')
ahairshi
  • 381
  • 3
  • 18
0
votes
2 answers

How to subset nan/inf values in Dato SFrames

Im trying to subset a column of a large data frame with a couple nan/inf values in one of the columns. I have tried for example something like this. df = df[df['a'] == 'NaN'] Or df = df[df['a'] == 'Inf'] How do I reference these types of values…
J_Heads
  • 490
  • 2
  • 11
0
votes
1 answer

SFrame Kmeans - Covert to Int, Float, Dict

I'm preparting data to run KMEAMS from Graphlab, and am running into the following error: tmp = data.select_columns(['a.item_id']) tmp['sku'] = tmp['a.item_id'].apply(lambda x: x.split(',')) tmp = tmp.unpack('sku') kmeans_model =…
jKraut
  • 2,325
  • 6
  • 35
  • 48
0
votes
1 answer

Converting SFrames into input dataset Sframes

I have a pretty bad way to convert my input logs to the input dataset. I have an SFrame sf with the following format: user_id int timestamp datetime.datetime action int reasoncode str action column takes up 9 values ranging from 1 to…
Parisa Rai
  • 155
  • 2
  • 4
  • 9
0
votes
1 answer

Insert selected rows from one Sframe to another Sframe

I am aware of append() function to append all rows of one SFrame to another. But I want to insert a specific row from one SFrame to anaother. Is there a way to pick say only 2nd row from Sframe1 and append it to SFrame2? Structure of Promocodes…
0
votes
5 answers

Python: Count of occurrences in dict from another list

I am trying to count the number of times a word exists in a dict column based on a subset of interested words. First I import my data products = graphlab.SFrame('amazon_baby.gl/') products['word_count'] =…
Trexion Kameha
  • 3,362
  • 10
  • 34
  • 60
0
votes
1 answer

Creating a lag variable in a Python Sframe by group

Hi I am trying to identify the lag variable for a set of groups in a python Sframe. The code I am using is: sf['lag'] = sf.groupby(['Group'])['Num'].shift(1) However I am getting the error: TypeError: groupby() takes at least 3 arguments (2…
MJS
  • 1
  • 1
0
votes
0 answers

Treat missing data as just another category

I have some data which is mostly user demographics. There are lot of survey questions which people have answered "yes" or "no". But the data naturally contains lots of missing values. I don't want to impute the missing values. I want to treat that…
Karup
  • 2,024
  • 3
  • 22
  • 48
0
votes
1 answer

Graphlab - OverflowError: Long too big to convert

I am creating different polynomial regression models, by passing different powers of same teaching feature. So if I want a polynomial model of degree 3 of the feature 'x'. Then to the regression model, I am passing x^1,x^2 and x^3 as the…
Mustafa Khan
  • 397
  • 1
  • 5
  • 13
0
votes
2 answers

Draw a boxplot using an sframe as data source

I'm practicing my ML Classification skills on The Billionaire Characteristics Database dataset. I'm using sframe for loading and manipulating the data and seaborn for visualization. In the process of data analysis, I wanted to draw a box plot…
iulian
  • 5,494
  • 3
  • 29
  • 39