pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.
Questions tagged [pandas-profiling]
150 questions
1
vote
0 answers
Error in aws Glue ImportError: cannot import name '_imaging' from 'PIL'
I have been trying to implement pandas profiling in aws glue. One of the dependencies it uses is pillow. Since these are not there in aws glue by default I packaged them into a zip file and uploaded it.
After that I ran the line
from…

Calvin Wilson
- 31
- 1
- 5
1
vote
2 answers
Using Pandas-Profiling in AWS Glue
I am trying to use pandas profiling in AWS Glue. I downloaded the wheel file and used it in the Glue Library Path. BUt whenever I am trying to run a pandas profiling, module missing error is coming up(like multimethod, visions, networkx, pillow and…

Ashish Pasupureddi
- 13
- 2
1
vote
0 answers
pandas-profiling aws glue to s3
I am trying to load the html output of pandas-profiling to s3 from my glue job but am getting empty files. Its an issue with saving the file in memory to s3. I tried the following solution but no luck Saving HTML in memory to S3 AWS Python…

pdangelo4
- 220
- 4
- 17
1
vote
1 answer
Pandas-Profiling.to_widgets(): Error displaying widget: model not found
Error screenshot
I've been facing an intermittent issue with pandas profiling widget not rendering & it has been going on and off for awhile.
I've tried this in the command prompt:
jupyter nbextension enable --py widgetsnbextension
it comes up…

Kayden
- 13
- 5
1
vote
1 answer
Is it possible to get a detailed list of word frequencies from Pandas Profiling?
I'm currently working with a large batch of files that require me to check the frequencies of certain strings. My first idea was to import all files into a single dataset and use a for loop to check all files for the strings using the following…

Daniel Coben
- 15
- 5
1
vote
1 answer
import Pandas_Profiling command results in ValueError: invalid literal for int() with base 10: '0rc0'
I successfully installed Pandas profiling module using:
pip install pandas-profiling
from the command prompt (as admin)
I tried importing pandas profiling two ways with the same result:
method 1) import pandas_profiling;
method 2) from…

Gregory Whittle
- 11
- 1
1
vote
1 answer
MemoryError when using pandas_profiling profile_report
I'm trying to profile an excel file, it is a very small data set, only 30 columns and 535 rows, but when I run the profile_report function it stops each time in a different percentage but always has the same…

Roberto Suárez
- 51
- 3
1
vote
1 answer
Customise plots in pandas-profiling (change colors/colorbar in correlation heatmap/..)
I have searched pandas profiling documentation and their advanced usage webpage but I could not find how to customise plots that it generates. Eg. now it shows correlation matrices with colorbar having blue as 1 and red as -1. What if I want to…

My Work
- 2,143
- 2
- 19
- 47
1
vote
0 answers
Column types problems in Pandas-profiling
I'm Using pandas-profiling in a dataset that contains 140 columns.
This is my code:
df = pd.read_parquet("ConsultationForm.parquet", columns= sorted(columnNames),engine="pyarrow")
df = df.fillna(value=np.nan)
df.head()
profile = ProfileReport(df,…

Reco Jhonatan
- 1,503
- 4
- 23
- 35
1
vote
1 answer
Hide the interactions section in pandas profiling report in html
I already tried hiding the correlations, missing_values and sample values. Now i'm trying to hide the interactions but i am not able to find the logic.

vijay
- 23
- 3
1
vote
0 answers
pandas-profiling "Duplicate rows" section is not showing-up in the HTML Report
I am using pandas-profiling=2.8.0 and I have generated an HTML report in which 2 duplicates are shown in the Overview Section, as seen below
But the "Duplicate rows" option/section is missing in my HTML Report header.
But in the shared example on…

PraveenS
- 115
- 13
1
vote
2 answers
Spark dataframe to pandas profiling
I am trying to achieve a data profiling with pandas-profiling library. i am fetching data directly from hive. this is the error i am receiving
Py4JJavaError: An error occurred while calling o114.collectToPython.
: org.apache.spark.SparkException:…
user2805507
1
vote
0 answers
AttributeError: module 'matplotlib.cbook' has no attribute '_get_data_path'
When trying to run 2 libraries on jupyter. I am running the following code:
import pandas as pd
from pandas_profiling import ProfileReport
This is returning the following error:
AttributeError Traceback (most…

Bhavantik Gondaliya
- 11
- 1
1
vote
0 answers
After installing pandas-profiling, I tried to import it and ran into a problem
I installed pandas-profiling using pip command and successfully installed it.
But, when importing it,
from pandas_profiling import ProfileReport
I ran into the problem,
ValueError: bad marshal data (unknown type code)

Himanshu Gupta
- 11
- 1
1
vote
1 answer
Getting error while installing `pandas-profiling`
Need Help I want to install pandas-profiling in python 3.8.2 but when i try to install the package i am facing errors.
pip install pandas-profiling
I am getting this error
error: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual…

Sajjan Karn
- 50
- 1
- 7