Questions tagged [sframe]

SFrame is a tabular data structure used to store millions of rows of data that do not necessarily fit into memory for analysis.

SFrame is Python package realized scalable, out-of-core dataframe which allows to work with datasets that are larger than the amount of RAM on your system.

Homepage: https://pypi.python.org/pypi/SFrame

82 questions
1
vote
4 answers

Dato: What's the equivalent function for graphlab.random_split() in pandas?

I'm doing a course on Machine Learning on Coursera. In the course, it is emphasised that we use GraphLab from Dato. In one of the exercises, the instructor used graphlab.random_split() to split an SFrame, like this: sales =…
HaggarTheHorrible
  • 7,083
  • 20
  • 70
  • 81
1
vote
2 answers

AttributeError: 'module' object has no attribute 'Sframe'

I installed Dato's GraphLab Create to run with python 27 first directly from its executable then manually via pip (instructions here) for troubleshooting. Code: import graphlab graphlab.SFrame() Output: [INFO] Start server at:…
remi
  • 781
  • 2
  • 13
  • 22
1
vote
1 answer

How do I find specific rows that throws an error when saving in Graphlab SFrame?

I have a SFrame that looks like this with sf.print_rows(10): +--------------+---------------+-------+-------------------------------+ | Dataset | Domain | Score | Sent1 …
alvas
  • 115,346
  • 109
  • 446
  • 738
1
vote
2 answers

Matrix multiplication with SFrame and SArray with Graphlab and/or Numpy

Given a graphlab.SArray named coef: +-------------+----------------+ | name | value | +-------------+----------------+ | (intercept) | 87910.0724924 | | sqft_living | 315.403440552 | | bedrooms | -65080.2155528 | | bathrooms |…
alvas
  • 115,346
  • 109
  • 446
  • 738
1
vote
1 answer

Cannot parse datetime in pm format with booster, C++, graphlab

I've tried to convert datetime string into datetime of an SArray (uses C++ booster library), but it does not seem to understand the %p format specifier. http://www.boost.org/doc/libs/1_43_0/doc/html/date_time/date_time_io.html This documentation…
AVX
  • 319
  • 1
  • 3
  • 13
0
votes
0 answers

How can I limit the number of results shown .show() in turicreate

I'm trying to create a histogram to show how many times a song appears in an SArray. The histogram output contains all the songs in the column and the all results are nearly invisible by compared to the "Other" rows which has shows all the not…
0
votes
1 answer

ValueError when calling activity_classifier.create(...) method

I am using TuriCreate to create model to classify a human activity, but I get error when I try to run activity_classifier.create(...) method. Code This is what I did: Load all data: train_sf = tc.SFrame("data/cleaned_train_sframe") valid_sf =…
Mushu
  • 31
  • 9
0
votes
0 answers

Graphlab and SFrame

I am taking a course on Machine Learning in which GraphLab and SFrame are being used for data manipulation and loading. When I try to load data using SFrame then I get this error: AttributeError Traceback (most recent…
Ramzan Shahid
  • 167
  • 1
  • 4
0
votes
1 answer

Read csv to sframe in google colab

I have created a new notebook of python 2 on google-colaboratory and installed sframe(!pip install sframe) but I'm unable to read the CSV file in sframe. Any help?
0
votes
1 answer

Graphlab Sframes - How to retain all columns in groupby

I have a sframe where I want to do a groupby with some operator on a column. But, this returns an sframe only with key columns specified. How can I do the operation on some columns, but keep all the columns nonetheless?
Dreams
  • 5,854
  • 9
  • 48
  • 71
0
votes
1 answer

Problems installing sFrame package

In this moment I have problems installing this package from github https://github.com/turi-code/SFrame I use a windows 10, it is to python version 2.7 I Tried to use pip install https://github.com/turi-code/SFrame It does not work. Then I tried to…
lnvrl
  • 1
  • 1
  • 3
0
votes
1 answer

Graphlab Sframe, retrieve multiple rows

I am trying to access multiple rows from a graphlab SFrame and convert them into a numpy array. I have a database fd of 96000 rows and 4096 columns and need to retrieve the row numbers that are stored in a numpy array. The method that I have come…
Nikolai K.
  • 77
  • 9
0
votes
2 answers

Transforming grouped data by converting grouping-categories into fields (using GraphLab or Panda's DataFrame)

I have the following records grouped by the user_id and action columns. user_id | action | count 1 | read | 15 1 | write | 5 1 | delete | 7 2 | write | 2 3 | read | 9 3 | write | 1 3 | delete | 2 I…
renakre
  • 8,001
  • 5
  • 46
  • 99
0
votes
0 answers

why cleaning text function doens't work without decoding to UTF8?

I wrote the following function in python 2.7 to clean the text but it doesn't work without decoding the tweet variable to utf8 # -*- coding: utf-8 -*- import re def clean_tweet(tweet): tweet = re.sub(u"[^\u0622-\u064A]", ' ', tweet,…
0
votes
1 answer

SFrame Column of type: Dictionary

when I run: my_sframe['col_1'] = '' I get a blank column, which is what I wanted. However when I run: my_sframe['col_1'] = {} I get an error that says unexpected data type. The SFrame API doesn't address this, as seen…
Arash Howaida
  • 2,575
  • 2
  • 19
  • 50