pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.
Questions tagged [pandas-profiling]
150 questions
0
votes
1 answer
Does ydata-profiling works in a Spark Envirnoment?
I need to analyze a huge table with approx 7 millions lines and 20 columuns.
I can read data in a dataframe without using Spark, but I can't have enough memory for computation.
Does someone know if the package can work in distributed spark…

Simocrep
- 3
- 2
0
votes
0 answers
How to customize pandas profiling report to display the 90th percentile value?
I am using the pandas profiling library to generate reports for my DataFrame. However, I would like to customize the report to include the 90th percentile value in the statistics section. I tried modifying the…

TheDS
- 101
- 2
- 11
0
votes
0 answers
Change plot size in pandas-profiling/ydata-profiling
How can I change pandas-profiling's plot size and font size (correlations: heatmap, missing values plots and heatmap) in Jupiter Notebook?
import pandas as pd
from ydata_profiling import ProfileReport
profile_test = ProfileReport(test,…

Dilmurat Basitov
- 1
- 1
0
votes
0 answers
Weighting Stats and Plots in Pandas Profiling
I have a column named "weight" in a pandas dataframe. Is there a method or way to use this column to weight all other columns when generating a pandas profiling report? I want to maintain the original values for sample count, min/max, etc. while…

craylor
- 1
- 1
0
votes
1 answer
missing columns for ydata-profiling correlation report
I'm using ydata-profiling (the evolution of pandas-profiling) to compute correlation among columns of large datasets (e.g. 400411 rows and 27 columns).
These are configurations in config.yaml:
correlations:
pearson:
calculate: false
…

Simocrep
- 3
- 2
0
votes
1 answer
How to customize customize alerts + other metrics in pandas_profiling / y_data_profiling alerts
pandas_profiling, or as it is now called, y_data_profiling provides a detailed breakdown of data quality.
How can we customize alerts + other metrics included in their default report?
I see options to change color scheme, and to hide existing…

wantering_otter
- 1
- 1
0
votes
2 answers
Ydata profiling correlations gives table only labeled "auto", which correlation is this and how to get all correlations?
I am analysing a dataset with a sample of 200 000. I am only getting a table labeled as auto for correlations.
I am running ydata profiling in Jupyter with the following command
"profile = ProfileReport(df, title = "Pandas Profiling…

kehlou
- 1
0
votes
1 answer
pandas-profiling / ydata-profiling : not able disable some basic alerts like "Zeros"
I'm using ydata-profiling (pandas profiling) and I'm not able to disable some alerts (e.g. Zeros).
Here, https://ydata-profiling.ydata.ai/docs/master/pages/advanced_usage/available_settings.html , I can find only some alerts which can be removed,…

user1403546
- 1,680
- 4
- 22
- 43
0
votes
1 answer
Compare Histograms side-by-side instead of single-page [ydata-profiling / pandas-profiling]
I would like to create a comparison report showing histograms side-by-side as in the example using the variable "Ferritin" mentioned on https://ydata.ai/resources/advanced-data-visualisation-with-pandas-profiling.
Unfortunately, when I try to…

dominik-bursy
- 1
- 1
0
votes
1 answer
ydata-profiling: AttributeError: 'Figure' object has no attribute 'supylabel' on Google Colab
I'm having an issue while trying to run the comparison report on ydata-profiling. I was trying to follow this tutorial and I'm not sure what went wrong...
import pandas as pd
from pandas_profiling import ProfileReport
df =…

SeaEngineering
- 36
- 2
0
votes
2 answers
Cannot import name 'to_html' from 'pandas_profiling.report' using JupyterLab
I'm new using Jupyter Lab and Pandas profiling.
I'm trying to install and import and install Pandas Profiling in a jupyter notebook. I'm able to install pandas collab using pip, but unable to import the library. The error says I cannot import name…
0
votes
1 answer
Could not install pandas_profiling packages due to an OSError: [WinError 5] Access is denied
I tried installing pandas_profiling in Jupyter Notebook and got the following error
!pip install pandas_profiling
it throws an error
I also tried this
"Run pip install pandas-profiling command in a separate cell in the jupyter notebook.
After this…

Ankit Thummar
- 1
- 4
0
votes
0 answers
Why is pandas-profiling report not working in kaggle
I'm trying to run this code in kaggle for EDA. As you can see everything seems correct but I don't see an option to open an html file. How would I access the file that I just created?

BannyM
- 212
- 2
- 8
0
votes
0 answers
Unable to do pandas profiling and generating report in notebook
This is the error which I always get. The vbox error comes in my both auto viz and profiling report. Please let me know if my notebook is outdated or where I am missing the part ?
**
type of the return value must be…

Priyanka Kachroo
- 1
- 1
0
votes
1 answer
How to import pandas profile report output as html/json file on AWS S3 location
I have a dataframe df, generating ProfileReport shown below:
profile = pandas_profiling.ProfileReport(
df, title=f"file_name Data Profile Report", minimal=True)
after profiling writing the output to local file system in ec2 machine…

Farooque
- 3,616
- 2
- 29
- 41