I have a dataframe called housing. One of the attributes of dataframe is "Ocean_proximity" which categorical attribute. I applied a condition of median_house_value to be 450k on a dataframe. Now here, I want to keep only one record on each category of "Ocean_proximity" and delete all other records.
I am using pandas and python3.0 '''
>>>housing[housing.median_house_value==450000][['median_income','median_house_value','ocean_proximity']]
>>>
median_income median_house_value ocean_proximity
993 6.1023 450000.0 INLAND
4265 1.7306 450000.0 <1H OCEAN
4623 0.8804 450000.0 <1H OCEAN
4676 5.8632 450000.0 <1H OCEAN
4685 3.6111 450000.0 <1H OCEAN
4717 2.7824 450000.0 <1H OCEAN
5427 2.2402 450000.0 <1H OCEAN
5506 3.6667 450000.0 <1H OCEAN
5890 4.0893 450000.0 <1H OCEAN
6555 7.7108 450000.0 INLAND
8314 2.1579 450000.0 ISLAND
8317 2.7361 450000.0 ISLAND
>>>housing
>>>
median_income median_house_value ocean_proximity
993 6.1023 450000.0 INLAND
4265 1.7306 450000.0 <1H OCEAN
8317 2.7361 450000.0 ISLAND