Questions tagged [graphlab]

GraphLab Create is a Python library, backed by a C++ engine, for quickly building large-scale, high-performance data products.

GraphLab Create is a Python library, backed by a C++ engine, for quickly building large-scale, high-performance data products. GraphLab Create has been renamed to Turi Create, more recent versions of which are open source.

Some key features of GraphLab Create are:

  • Analyze terabyte scale data at interactive speeds, on your desktop.
  • A Single platform for tabular data, graphs, text, and images.
  • State of the art machine learning algorithms including deep learning, boosted trees, and factorization machines.
  • Run the same code on your laptop or in a distributed system, using a Hadoop Yarn or EC2 cluster.
  • Focus on tasks or machine learning with the flexible API.
  • Easily deploy data products in the cloud using Predictive Services.
  • Visualize data for exploration and production monitoring.
223 questions
0
votes
2 answers

Error importing numpy & graphlab after installing ipython

I have got a strange issue. I am now using graphlab/numpy to develop a project via Pycharm 5. OS is Mac OS 10.11.5. I created a p2.7 virtual environment for the project. Programme runs well. But after I install ipython, I can no longer import…
Scott
  • 1
  • 1
0
votes
1 answer

Assign a matrix row to a single data frame column

I have a matrix tf.m NxM and and data frame df with N rows.I want to assing row n of the matrix to a column in the data frame, at the same row n. library("tm") ftfidf <- function(text.d) { txt <- VectorSource(text.d); txt.corpus <- VCorpus(txt,…
Stefano Piovesan
  • 1,185
  • 3
  • 19
  • 37
0
votes
0 answers

Graphlab SFrame constructor converting NumPy S1 dtype to int

Here is an example: import graphlab as gl import numpy.random as npr npr.choice(['a', 'b', 'c', 'd'], size = 100, replace=True).dtype df_so = gl.SFrame({"col1": npr.randint(low = 1, high = 3, size = 100), "col2": npr.randint(low…
tchakravarty
  • 10,736
  • 12
  • 72
  • 116
0
votes
1 answer

Graphlab - OverflowError: Long too big to convert

I am creating different polynomial regression models, by passing different powers of same teaching feature. So if I want a polynomial model of degree 3 of the feature 'x'. Then to the regression model, I am passing x^1,x^2 and x^3 as the…
Mustafa Khan
  • 397
  • 1
  • 5
  • 13
0
votes
1 answer

Import csv in Graph lab with particular datatype

In graphlab, I am facing the following problem: feat1 = gl.SFrame.read_csv(dir_path + '/data/' + 'file_1.csv') 'feat1' contains a column named 'movieId' with type 'int'. feat1.dtype 0 float 1 float 2 float 3 float 4 float 5 float 6 …
Yas
  • 811
  • 4
  • 11
  • 20
0
votes
1 answer

GraphLab .apply to sframe

I have trying to do a very simple .apply to an sarray in graph lab create and I am confused as to why it is not working. def myfunc (x): if(x == 0): x = -1 sa = SArray([0,0,0,1,1], dtype=int) sa.apply(myfunc, dtype = int) My output is [None,…
Hound
  • 932
  • 2
  • 17
  • 26
0
votes
1 answer

For my jupyter notebook in the dato-env key words are not highlighted and code does not seem to run. Given the errors I see, why is that?

Below are the errors I get on the terminal which the Dato Launcher starts. The Jupyter notebook does come up, but keywords are not highlighted and execute cell does not seem to do anything (for e.g. a print won't actually print anything). [E…
yalis
  • 1,508
  • 1
  • 16
  • 24
0
votes
1 answer

Graphlab Nearest neighbor method, not correct output

I am trying to find similarity between the new users and the existing ones(generated a trained model from recommend method of graphlab) from trained model. So, tried using nearest neighbor similarity method(query option) in graphalb to do in this…
0
votes
1 answer

How to use Graphlab recommend() for providing recommendations to new user?

In Graphlab, I am trying to use recommend() method, to see how it provides recommendation for a new user(user_id) which isn't present in the trained model prepared from give dataset. Since the aim is to determine similar users through this…
0
votes
1 answer

Calculate TF for tags on a movie database python/graphab

Given a number of movies with their associated tags (tags are keywords), how can I calculate the TF or TF-IDF vector for each movie? Are their ways of doing this automatically using a library in Graphlab or Python? Here is my input: print…
Yas
  • 811
  • 4
  • 11
  • 20
0
votes
1 answer

Grpahlab SFrames: Error in using SFrames with the dataset

In Graphlab, I am working with small set of fitness data, to use recommender functions that could provide recommendations. The dataset has userid's column but not item id's, instead different items arranged in columns and their respective ratings…
0
votes
1 answer

How to change an array to SFrame for GraphLab ItemSimilarityRecommend

I have written my custom pairwise similarity function in python which given a matrix of features X (contains rows of features), find and returns the output as k nearest neighbor to each item given a similarity metric: def…
Yas
  • 811
  • 4
  • 11
  • 20
0
votes
1 answer

Datetime in python - speed of calculations - big data

I want to find the difference (in days) between two columns in a dataframe (more specifically in the graphlab SFrame datastructure). I have tried to write a couple of functions to do this but I cannot seem to create a function that is fast enough.…
mkultra
  • 321
  • 3
  • 9
0
votes
1 answer

Creating sframe logistic classifier

I'm using graphlab-create to build a logistic classifier. My data is in an sframe (I've checked this using .dtype) This is for a Coursera class on machine learning using a zip file of Amazon reviews: products =…
0
votes
1 answer

Error installing graphlab

I tried installing graphlab using the pip command where we enter the email address and product key in the command. I tried upgrading it. I am using the sudo pip command and I am working on a Mac. I am getting this exception every time and it fails.…