I have using Pandas for data processing before training a binary classifier. One of the things I could not find was a function that tells me given a value of a certain feature, let's say Age (people who are for example 60 years old) which percentage of this people are classified as 1 or as 0 (in the binary data column). And this for all different ages in the Age column.
A simple example to illustrate my idea. I have the following DataFrame:
import pandas as pd
data = pd.DataFrame({'Age': [23, 24, 23 ,25 ,24 ,24 ,20], 'label': [0, 1, 1, 0, 1, 1, 0]})
and I want a function that gives me the percentage of people from all ages that are labeled as 0 or as 1. Like so:
Age Percentage
0 20 0.0
1 23 0.5
2 24 1.0
3 25 0.0
Is there any function already implementing that? Because I could not find one and I find this a pretty common need for data analysis in binary classification problems.
Thank you!