Questions tagged [pandas-profiling]

pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.

150 questions
0
votes
1 answer

Does ydata-profiling works in a Spark Envirnoment?

I need to analyze a huge table with approx 7 millions lines and 20 columuns. I can read data in a dataframe without using Spark, but I can't have enough memory for computation. Does someone know if the package can work in distributed spark…
Simocrep
  • 3
  • 2
0
votes
0 answers

How to customize pandas profiling report to display the 90th percentile value?

I am using the pandas profiling library to generate reports for my DataFrame. However, I would like to customize the report to include the 90th percentile value in the statistics section. I tried modifying the…
TheDS
  • 101
  • 2
  • 11
0
votes
0 answers

Change plot size in pandas-profiling/ydata-profiling

How can I change pandas-profiling's plot size and font size (correlations: heatmap, missing values plots and heatmap) in Jupiter Notebook? import pandas as pd from ydata_profiling import ProfileReport profile_test = ProfileReport(test,…
0
votes
0 answers

Weighting Stats and Plots in Pandas Profiling

I have a column named "weight" in a pandas dataframe. Is there a method or way to use this column to weight all other columns when generating a pandas profiling report? I want to maintain the original values for sample count, min/max, etc. while…
craylor
  • 1
  • 1
0
votes
1 answer

missing columns for ydata-profiling correlation report

I'm using ydata-profiling (the evolution of pandas-profiling) to compute correlation among columns of large datasets (e.g. 400411 rows and 27 columns). These are configurations in config.yaml: correlations: pearson: calculate: false …
0
votes
1 answer

How to customize customize alerts + other metrics in pandas_profiling / y_data_profiling alerts

pandas_profiling, or as it is now called, y_data_profiling provides a detailed breakdown of data quality. How can we customize alerts + other metrics included in their default report? I see options to change color scheme, and to hide existing…
0
votes
2 answers

Ydata profiling correlations gives table only labeled "auto", which correlation is this and how to get all correlations?

I am analysing a dataset with a sample of 200 000. I am only getting a table labeled as auto for correlations. I am running ydata profiling in Jupyter with the following command "profile = ProfileReport(df, title = "Pandas Profiling…
kehlou
  • 1
0
votes
1 answer

pandas-profiling / ydata-profiling : not able disable some basic alerts like "Zeros"

I'm using ydata-profiling (pandas profiling) and I'm not able to disable some alerts (e.g. Zeros). Here, https://ydata-profiling.ydata.ai/docs/master/pages/advanced_usage/available_settings.html , I can find only some alerts which can be removed,…
user1403546
  • 1,680
  • 4
  • 22
  • 43
0
votes
1 answer

Compare Histograms side-by-side instead of single-page [ydata-profiling / pandas-profiling]

I would like to create a comparison report showing histograms side-by-side as in the example using the variable "Ferritin" mentioned on https://ydata.ai/resources/advanced-data-visualisation-with-pandas-profiling. Unfortunately, when I try to…
0
votes
1 answer

ydata-profiling: AttributeError: 'Figure' object has no attribute 'supylabel' on Google Colab

I'm having an issue while trying to run the comparison report on ydata-profiling. I was trying to follow this tutorial and I'm not sure what went wrong... import pandas as pd from pandas_profiling import ProfileReport df =…
0
votes
2 answers

Cannot import name 'to_html' from 'pandas_profiling.report' using JupyterLab

I'm new using Jupyter Lab and Pandas profiling. I'm trying to install and import and install Pandas Profiling in a jupyter notebook. I'm able to install pandas collab using pip, but unable to import the library. The error says I cannot import name…
0
votes
1 answer

Could not install pandas_profiling packages due to an OSError: [WinError 5] Access is denied

I tried installing pandas_profiling in Jupyter Notebook and got the following error !pip install pandas_profiling it throws an error I also tried this "Run pip install pandas-profiling command in a separate cell in the jupyter notebook. After this…
0
votes
0 answers

Why is pandas-profiling report not working in kaggle

I'm trying to run this code in kaggle for EDA. As you can see everything seems correct but I don't see an option to open an html file. How would I access the file that I just created?
BannyM
  • 212
  • 2
  • 8
0
votes
0 answers

Unable to do pandas profiling and generating report in notebook

This is the error which I always get. The vbox error comes in my both auto viz and profiling report. Please let me know if my notebook is outdated or where I am missing the part ? ** type of the return value must be…
0
votes
1 answer

How to import pandas profile report output as html/json file on AWS S3 location

I have a dataframe df, generating ProfileReport shown below: profile = pandas_profiling.ProfileReport( df, title=f"file_name Data Profile Report", minimal=True) after profiling writing the output to local file system in ec2 machine…
Farooque
  • 3,616
  • 2
  • 29
  • 41