I am doing a research project which wants to work out Hong Kong people's attitudes toward the public transportation system. I have collected millions of tweets using Twitter API. Since my research target is local Hong Kong people, tweets posted from official accounts or tweets contain advertisements may be useless.
Hence, could anyone give some hint about how to filter out the tweets posted from the official account? I know my explanation might be a little bit abstract. Any tips would be appreciated! Thank you!