Questions tagged [intake]

The python data access and cataloguing project "Intake", https://intake.readthedocs.io/en/latest/

25 questions
1
vote
1 answer

Intake Cache specify filename/location

I'm trying to use intake and the intake-xarray to open and store remote files. I have a minimized catalog file here: /isibhv/projects/paleo_pool/boundary_conditions/ice_sheet_reconstructions/ice_sheet_reconstructions.yaml It looks like…
pgierz
  • 674
  • 3
  • 7
  • 14
1
vote
1 answer

Moving data from a database to Azure blob storage

I'm able to use dask.dataframe.read_sql_table to read the data e.g. df = dd.read_sql_table(table='TABLE', uri=uri, index_col='field', npartitions=N) What would be the next (best) steps to saving it as a parquet file in Azure blob storage? From my…
Ray Bell
  • 1,508
  • 4
  • 18
  • 45
1
vote
1 answer

Persisting only part of a data source

I'm using intake to access the catalog catalog.ocean.GFDL_CM2_6.GFDL_CM2_6_control_ocean_surface. At the moment I only work with small patches of that data, but accessing that data every single time is still quite costly (it's on Google Cloud…
ArthurPGB
  • 43
  • 4
0
votes
0 answers

How to filter a catalog by date when importing intake in Python?

I have the following code import intake import pandas as pd # Importing json file catalog_file = "../my/path/path.json" #using `intake` to manage the data cat = intake.open_esm_datastore(catalog_file) # Furthering filtering the data hits =…
jei L
  • 33
  • 6
0
votes
1 answer

Inake-xarray: AttributeError: 'list' object has no attribute 'startswith'

I try to run this code, which ought to work... import xarray as xr import intake import hvplot.xarray import dask catalog_url =…
Jonathan Roy
  • 405
  • 1
  • 6
  • 18
0
votes
1 answer

Hiding secrets in intake catalog for remote access (S3/MinIO)

I'm trying to build an intake catalog for my team. The datasets are on a shared MinIO server for which each user should have their own service account, and therefore a key/secret pair. When creating the first catalog entry like this: source =…
Adrian
  • 755
  • 9
  • 17
0
votes
2 answers

Handle environmental variables in config options

I have snakemake command line with configuration options like this: snakemake --config \ f1=$PWD/file1.txt \ f2=$PWD/file2.txt \ f3=/path/to/file3.txt \ ...more key-value pairs \ --directory /path/to/output/dir file1.txt and…
dariober
  • 8,240
  • 3
  • 30
  • 47
0
votes
2 answers

Relative paths in config.yaml for Snakefile

How can I use relative paths in my configuration file so that users do not need to change USER in the paths for output directories? I have this: config.yml proj_name: H1N1_rhesus contact: email: user.edu person: user 01-preprocess:…
Noah_Seagull
  • 337
  • 5
  • 18
0
votes
1 answer

Add parameters to Python Intake LocalCatalogEntry

I'm trying to build a LocalCatalogEntry for Python's Intake package (as part of a larger catalog, which might have multiple entries, one of which I'm trying to create here). However, I can't seem to figure out how to feed it user parameters to…
Jessica
  • 505
  • 1
  • 3
  • 11
0
votes
1 answer

'import intake' in google colab generates ContextualVersionConflict

!pip install intake-esm install the latest version of intake-esm (2020.12.18) in google colab but the import intake statement generates the following error: ContextualVersionConflict: (requests 2.23.0 (/usr/local/lib/python3.6/dist-packages),…
1
2