Questions tagged [blaze]

Blaze is a NumPy/Pandas like interface to data analytics developed by Continuum Analytics.

Blaze is intended to provide an expressive, compact set of foundational abstractions for composing computations over large amounts of semi-structured data.

81 questions
3
votes
1 answer

Efficient way to find particular rows with Blaze package?

I have a data table that has ~74 million lines that I used blaze to load it. from blaze import CSV, data csv = CSV('train.csv') t = data(csv) It has fields these: A, B, C, D, E, F, G Since this is such a large dataframe, how can I efficiently…
3
votes
1 answer

How do I read tabulator separated CSV in blaze?

I have a "CSV" data file with the following format (well, it's rather a TSV): event pdg x y z t px py pz ekin 3383 11 -161.515 5.01938e-05 -0.000187112 0.195413 0.664065 0.126078 -0.736968 0.00723234 1694 11 …
honza_p
  • 2,073
  • 1
  • 23
  • 37
3
votes
0 answers

Blaze can not tackle with BIGINT, TEXT data structure of postgresql database

I am trying Blaze to analyze data in my postgres db connecting use conn_string = 'postgresql://postgres:mysecretpassword@localhost:5432/postgres' Data(resource(conn_string, schema='public')) it gives the error Blaze does not understand a SQLAlchemy…
Hello lad
  • 17,344
  • 46
  • 127
  • 200
3
votes
1 answer

correlation between columns python blaze

Got a simple question about how-to use python blaze module for analysis. So, i'm trying to do this code: from blaze import SQL,Table from sqlalchemy import create_engine from scipy.stats import…
renardeinside
  • 377
  • 1
  • 9
3
votes
1 answer

How do you install the blaze module (Continuum analytics) in Python?

How do you install blaze natively (i.e., not in a virtual environment) in Python? The only instructions I find are on in the package's doc (see link), and here, in a virtual environment.
BenDundee
  • 4,389
  • 3
  • 28
  • 34
2
votes
0 answers

Is there a shutdown hook for http4s / blaze / cats-effect

I am running a http4s server using a blaze backend with cats-effect IO effects. I have an event logger that collects events and periodically flushes them to persistent storage. I'm wanting to ensure that any events in the queue are flushed before…
Joseph Thomas-Kerr
  • 339
  • 1
  • 4
  • 14
2
votes
0 answers

blaze-odo error when trying to export an odo result to csv with MSSQL backend

I'm having an issue exporting some data from a MSSQL back-end to csv using blaze and odo. The first thing that I do is create a blaze.data object that connects to the MSSQL db using a dns connection and pyodbc as the driver. adjustment_statuses is a…
jgens
  • 21
  • 2
2
votes
1 answer

dask.bag processing data out-of-memory

I'm trying to use dask bag for wordcount 30GB of json files, I strict according to the tutoral from offical web: http://dask.pydata.org/en/latest/examples/bag-word-count-hdfs.html But still not work, my single machine is 32GB memory and 8 cores…
SharpLu
  • 1,136
  • 2
  • 12
  • 28
2
votes
1 answer

Blaze Data field map throws TypeError

I have recently started moving my data exploration code set from pandas to blaze. I am running into the following issue. Assume: from blaze import * s = Data([(1, 'Alice', 100), ... (2, 'Bob', -200), ... (3, 'Charlie',…
tipanverella
  • 3,477
  • 3
  • 25
  • 41
2
votes
1 answer

get only numeric columns in python blaze object

I have a blaze data object like this import blaze as bz bdata = bz.Data([(1, 'Alice', 100.9), (2, 'Bob', 200.6), (3, 'Charlie', 300.45), (5, 'Edith', 400)], fields=['id', 'name', 'amount']) bdata | id |…
Kathirmani Sukumar
  • 10,445
  • 5
  • 33
  • 34
2
votes
1 answer

Unsupported string encoding using Blaze Data(engine)

Is there a workaround for this? I have a MSSQL table that uses collation settings of SQL_Latin1_General_CP1_CI_AS. engine = create_engine('mssql+pyodbc://'+ServerName+'/'+Database+'?driver='+ServerDriver+'?charset=cp2150') I am able to load it into…
BernardL
  • 5,162
  • 7
  • 28
  • 47
2
votes
2 answers

Build sqlalchemy query from string

How can I convert a sql where clause string to a sqlalchemy query? I'm assuming I already know the table. I'm building an Angular webapp which hits a Flask API for data. Flask is using sqlalchemy to query the db. jQuery-QueryBuilder fromSQL …
postelrich
  • 3,274
  • 5
  • 38
  • 65
2
votes
1 answer

Appending bcolz columns with Blaze

Let's first construct a ctable: import pandas as pd import blaze as bl df = pd.DataFrame({'x': range(4), 'y': [2., 4., 2., 4.]}) bl.odo(df, 'test.bcolz') Now suppose I wanna add a column called 'x_mod' to this table. I tried test_table =…
cd98
  • 3,442
  • 2
  • 35
  • 51
2
votes
1 answer

Maintain data columns when converting pandas hdfstore with odo

I'm using odo from the blaze project to merge multiple pandas hdfstore tables following the suggestion in this question: Concatenate two big pandas.HDFStore HDF5 files The stores have identical columns and non-overlapping indicies by design and a…
Kyle
  • 2,814
  • 2
  • 17
  • 30