Questions tagged [pandas-profiling]

pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.

150 questions
1
vote
0 answers

Error in aws Glue ImportError: cannot import name '_imaging' from 'PIL'

I have been trying to implement pandas profiling in aws glue. One of the dependencies it uses is pillow. Since these are not there in aws glue by default I packaged them into a zip file and uploaded it. After that I ran the line from…
1
vote
2 answers

Using Pandas-Profiling in AWS Glue

I am trying to use pandas profiling in AWS Glue. I downloaded the wheel file and used it in the Glue Library Path. BUt whenever I am trying to run a pandas profiling, module missing error is coming up(like multimethod, visions, networkx, pillow and…
1
vote
0 answers

pandas-profiling aws glue to s3

I am trying to load the html output of pandas-profiling to s3 from my glue job but am getting empty files. Its an issue with saving the file in memory to s3. I tried the following solution but no luck Saving HTML in memory to S3 AWS Python…
pdangelo4
  • 220
  • 4
  • 17
1
vote
1 answer

Pandas-Profiling.to_widgets(): Error displaying widget: model not found

Error screenshot I've been facing an intermittent issue with pandas profiling widget not rendering & it has been going on and off for awhile. I've tried this in the command prompt: jupyter nbextension enable --py widgetsnbextension it comes up…
Kayden
  • 13
  • 5
1
vote
1 answer

Is it possible to get a detailed list of word frequencies from Pandas Profiling?

I'm currently working with a large batch of files that require me to check the frequencies of certain strings. My first idea was to import all files into a single dataset and use a for loop to check all files for the strings using the following…
1
vote
1 answer

import Pandas_Profiling command results in ValueError: invalid literal for int() with base 10: '0rc0'

I successfully installed Pandas profiling module using: pip install pandas-profiling from the command prompt (as admin) I tried importing pandas profiling two ways with the same result: method 1) import pandas_profiling; method 2) from…
1
vote
1 answer

MemoryError when using pandas_profiling profile_report

I'm trying to profile an excel file, it is a very small data set, only 30 columns and 535 rows, but when I run the profile_report function it stops each time in a different percentage but always has the same…
1
vote
1 answer

Customise plots in pandas-profiling (change colors/colorbar in correlation heatmap/..)

I have searched pandas profiling documentation and their advanced usage webpage but I could not find how to customise plots that it generates. Eg. now it shows correlation matrices with colorbar having blue as 1 and red as -1. What if I want to…
My Work
  • 2,143
  • 2
  • 19
  • 47
1
vote
0 answers

Column types problems in Pandas-profiling

I'm Using pandas-profiling in a dataset that contains 140 columns. This is my code: df = pd.read_parquet("ConsultationForm.parquet", columns= sorted(columnNames),engine="pyarrow") df = df.fillna(value=np.nan) df.head() profile = ProfileReport(df,…
Reco Jhonatan
  • 1,503
  • 4
  • 23
  • 35
1
vote
1 answer

Hide the interactions section in pandas profiling report in html

I already tried hiding the correlations, missing_values and sample values. Now i'm trying to hide the interactions but i am not able to find the logic.
vijay
  • 23
  • 3
1
vote
0 answers

pandas-profiling "Duplicate rows" section is not showing-up in the HTML Report

I am using pandas-profiling=2.8.0 and I have generated an HTML report in which 2 duplicates are shown in the Overview Section, as seen below But the "Duplicate rows" option/section is missing in my HTML Report header. But in the shared example on…
PraveenS
  • 115
  • 13
1
vote
2 answers

Spark dataframe to pandas profiling

I am trying to achieve a data profiling with pandas-profiling library. i am fetching data directly from hive. this is the error i am receiving Py4JJavaError: An error occurred while calling o114.collectToPython. : org.apache.spark.SparkException:…
user2805507
1
vote
0 answers

AttributeError: module 'matplotlib.cbook' has no attribute '_get_data_path'

When trying to run 2 libraries on jupyter. I am running the following code: import pandas as pd from pandas_profiling import ProfileReport This is returning the following error: AttributeError Traceback (most…
1
vote
0 answers

After installing pandas-profiling, I tried to import it and ran into a problem

I installed pandas-profiling using pip command and successfully installed it. But, when importing it, from pandas_profiling import ProfileReport I ran into the problem, ValueError: bad marshal data (unknown type code)
1
vote
1 answer

Getting error while installing `pandas-profiling`

Need Help I want to install pandas-profiling in python 3.8.2 but when i try to install the package i am facing errors. pip install pandas-profiling I am getting this error error: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual…
Sajjan Karn
  • 50
  • 1
  • 7