Questions tagged [pandas-profiling]

pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.

150 questions
0
votes
0 answers

Pandas Profiling cannot be imported into jupyter lab

When i try to import pandas profiling into jupyter lab this error message appears: ImportError: cannot import name 'DataError' from 'pandas.core.base' (C:\Users\Basti\AppData\Local\Programs\Python\Python311\Lib\site-packages\pandas\core\base.py) My…
0
votes
1 answer

from pandas_profiling import profilereport Error

I've installed the pandas_profiling package but, the system is not recognizing the package. I'm using VS Code for the project. Hi, I'm trying to install pandas profiling for an ML testing project I've checked multiple Question Related to it on…
0
votes
2 answers

Detecting similar columns across multiple files based on statistical profile

I'm attempting to clean up a set of old files that contain sensor data measurements. Many of the files don't have headers, and the format (column ordering, etc.) is inconsistent. I'm thinking the best that I can do in these cases is to match…
Ryan Gross
  • 6,423
  • 2
  • 32
  • 44
0
votes
0 answers

Pandas profiling html report too big to render

With 450 rows and 70 columns in my data frame, I produced a pandas profiling html report. Since the HTML file is 239MB in size, neither the Safari nor Chrome browsers allow me to see that report. I tried the "minimum =True" option, but in that case…
0
votes
1 answer

When trying to create html report the program throws error in

When executing the below profile = ProfileReport(df,title="Data Profile Report") profile.to_file("data_profile_report.html") Here is the exception thrown --------------------------------------------------------------------------- …
AllenFeng
  • 11
  • 2
0
votes
1 answer

How to save a .html file to S3 in a Sagemaker processing container

I am working in Sagemaker using python trying to profile a dataframe that is saved in a S3 bucket with pandas profiling. The data is very large so instead of spinning up a large EC2 instance, I am using a SKLearn processor. Everything runs fine but…
0
votes
0 answers

Pandas Profiling: IndexError: only integers, slices (`:`), ellipsis (`...`), numpy.newaxis (`None`) and integer or boolean arrays are valid indices

Screenshot showing the error: IndexError: only integers, slices (:), ellipsis (...), numpy.newaxis (None) and integer or boolean arrays are valid indices
0
votes
1 answer

Unable to import pandas profiling in kaggle noteook

I am working in kaggle notebook and wanted to use pandas profiling for EDA but it is giving error. What can be done to use this library in kaggle notebooks?
Ankit Seth
  • 729
  • 1
  • 9
  • 23
0
votes
1 answer

Error installing pandas_profiling in spyder

Can't install pandas_profiling in spyder Please if you can then tell how to install pandas profiling in spyder Also using only Spyder, Spyder IDE Tried these: pip install pandas-profiling pip install pandas_profiling==2.5.0 pip install…
Tiya
  • 1
  • 2
0
votes
1 answer

pandas profiling with dask-dataframe. IndexError

I get an IndexError (IndexError: only integers, slices (:), ellipsis, nmpy.newaxis and integer or bolean arays are valid indices) while pandas profiling with dask. data: 290170 x 55 import dask.dataframe as dd from pandas_profiling import…
0
votes
1 answer

installing pandas-profiling error in python 3.10

I am trying to install pandas-profiling in python 3.10 using pychan installation package option. It is giving error. I tried the same using command prompt. Giviing the same error. Few last lines of error are as…
0
votes
1 answer

Wheel building failed for phik, while installing pandas-profiling

I tried appending my issue to a pre-existing query on the same topic opened up yesterday, but I was not able to comment, and thus ended up making a new question. I tried installing wheel and phik individually, but kept getting the same error, as…
Giri Nair
  • 1
  • 1
0
votes
1 answer

only generate a part of pandas profiling report

I would like to use pandas profiling to generate some kind of summary to look at on a regular basis. I don't need the whole report though, I'm just interested in the alerts or warnings section. So I was wondering if there is a way to only output…
corianne1234
  • 634
  • 9
  • 23
0
votes
0 answers

No module named 'pandas-profiling' error in Rstudio

title: "Python in R" output: html_document version: RStudio 2021.09.0 Build 351 "Ghost Orchid" OS: macOS Monterey 12.0.1 knitr::opts_chunk$set(echo = TRUE) library (reticulate) repl_python() reticulate::py_config() pd <- import("pandas") pp<-…
pavanv
  • 13
  • 4
0
votes
0 answers

How to create multiple pandas profiling reports for multiple csv files in a directory? The report name should match the file name

I tried this, import glob import os import pandas as pd import pandas_profiling from pandas_profiling import ProfileReport files = glob.glob("D:\home_health_services_current_data\*.csv") df = pd.DataFrame() for f in files: csv =…