Questions tagged [pandas-profiling]

pandas-profiling is a Python package for exploratory data analysis. Use this tag when your question is specifically related to this package, in contrast to general exploratory data analysis.

150 questions
2
votes
1 answer

how to install pandas-profiling with markupsafe error

I am trying to install pandas-profiling but I keep getting the error that markupsafe cannot find 2.1.1. version. !pip3 install pandas-profiling >> ERROR: Could not find a version that satisfies the requirement markupsafe~=2.1.1 (from…
matt.aurelio
  • 381
  • 2
  • 9
2
votes
1 answer

How to fix this error while using pandas profiling in jupyter notebook

Everytime I use pandas profiling in different data sets, notebook shows me this error. IndexError: only integers, slices (:), ellipsis (...), numpy.newaxis (None) and integer or boolean arrays are valid indices. import pandas as pd df =…
2
votes
1 answer

Extract specific values from pandas profiling to a table

I'm using pandas profiling to make HTML reports of some dataframes with 150+ attributes. I'd like to extract some of the information and arrange it in a simple table. Specifically, I need the number of missing data in each variable 'n_missing',…
GiveGet_15
  • 107
  • 8
2
votes
0 answers

Pandas Profiling-Interpret .html data to a readable table format using python, pyspark

I've used pandas profiling to display the pandas dataframe data in .html via the Databricks, is there a way I can make use of this great .html report in table format. From further automation point of view, I would need to transform the .html report…
chaitra k
  • 371
  • 1
  • 4
  • 18
2
votes
1 answer

Show "Other Values" in Pandas Profiling Report

I am creating a Pandas Profiling Report. Under "Variable" detailed information of each column is displayed. When I press "Toggle details" and navigate to "categories" the most common values appear ending with "Other values" which are about 50% in my…
liam beck
  • 77
  • 7
2
votes
1 answer

Integrating pandas-profiling report in dash app

How to integrate pandas-profiling report into a dash app? Pandas Profiling Streamlit allows these integrations (but I'm having a hard time managing cache/sessions in…
2
votes
0 answers

How to make an executable that includes pandas-profiling?

Based on this ticket https://github.com/pandas-profiling/pandas-profiling/issues/706 (suggesting to ask on stackoverflow), is there any way (maybe a modified spec file) to get a simple executable from pyinstaller that uses pandas profiling? For…
Joseph Doob
  • 163
  • 15
2
votes
1 answer

Unable to run Pandas Profiling on Databricks

I am trying to run the Pandas Profiling on a sample data frame on Databricks environment. Getting an error related to marplotlib, not sure if this issue related to Matplotlib or pandas-profiling. Any help would be highly appreciated. Databricks…
Naga Budigam
  • 689
  • 1
  • 10
  • 26
2
votes
1 answer

How to run Profile Report from pandas_profiling?

I imported pandas, numpy and pandas profiling with codes below but for some reason it won't display any data but 3 lines of information, so here are codes: from pandas_profiling import ProfileReport profile=ProfileReport(df, title='Pandas Profiling…
2
votes
1 answer

don't use index in pandas-profiling

When running pandas-profiling on a dataframe I see it analyses the index as a variable. Note: My index is a unique key (named UUID) Is there a way to exclude bringing in the index to report? I understand I could remove it in pandas but in my head I…
Ray Bell
  • 1,508
  • 4
  • 18
  • 45
2
votes
1 answer

How can I get the numbers for the correlation matrix from Pandas Profiling

I really like the heatmap, but what I need are the numbers behind the heatmap (AKA correlation matrix). Is there an easy way to extract the numbers?
CHW
  • 31
  • 2
2
votes
1 answer

pandas_profiling TypeError when using HTML format

I follow the pandas_profiling document script, but this problem always arises. My dataset is the boston from sklearn. I have the report, but without the features of an html version: profile2 = ProfileReport(data, title="Relatório DATASET -data-",…
Douglas
  • 115
  • 1
  • 1
  • 9
2
votes
1 answer

Unable to run a selective pandas profiling for large dataset

I have a large dataset with 100 columns and 100000 rows and I'm trying to run a pandas profile report but it generates a very large file html file(300MB). Unable to open that file on any browser. So I tried minimal=True but that just provides…
2
votes
1 answer

Pandas Profiling - KeyError: 'Requested level (var1) does not match index name (None)'

I am receiving KeyError: 'Requested level (var1) does not match index name (None)'. , on running the below code. pandas_profiling version = 2.8.0 Code: Profile = df_NYC.profile_report(title = "NYC Flight Data Profiling before…
Ash M
  • 23
  • 1
  • 4
2
votes
0 answers

Extracting data from pandas_profiling description with dict

I have a dict, i want to extract data after i did a profiling using pandas-profiling. i am trying to get the data for got gfcid? i tried to see what are the keys() and it return 4 keys. dict_keys(['table', 'variables', 'freq',…
user2805507
1 2
3
9 10