I have a DataFrame, and one column is "lang" for "language."
Two different values in this column are "en" for "English" and "en-gb" for "British English."
There are numerous other values in this row, including "es" for "Spanish, "fr" for "French," and so on.
So it looks something like this:
user lang id
joe en 77788
jim en-gb 23323
pedro es 12134
tom en 53892
juan es 24434
phillippe fr 04211
george en-gb 99999
For the purposes of my analysis, I want to count the 'en' and 'en-gb' values together as being the same "en" or "English" value. Perhaps I could put just this column into a Series and then count them as one, or I could change the "en-gb" values with "en."