Two challenges here to what I'm trying to accomplish.
- A DataFrame where the same company is listed for 2 consecutive rows. The first row associated with each company is related to Apple (iOS) and the second is for Android.
- I need to have the 'App Views' column represented as an int and then the other columns would be a % of the views. (so if there are 5000 app views the next column for Apple would be installs and I want to show the % of users who viewed the app, then installed it). For this I'll need several columns beyond instal but to keep it short I am just leaving it like this!
That's the first part of the challenge. For the 2nd part of the challenge:
- I really need to be able to make a big DataFrame full of fake data. Maybe Faker? The way the fake data needs to be populated would be with random values. So for each company I need a random number for Apple Views and then a 0 for Android, and in the next row a random number for Android views and a 0 for Apple. Then I'll need to take a % of those views and have randomized %'s for the next column.
The table is the result I am looking for:
( If this seems like a terrible idea to do in python and would be easier to do in excel somehow that's a great answer too just need someone can point me in the right direction if that is the case then I could then import a .CSV into a DataFrame! )
Company Name Apple App Views Apple Install Droid View DoidInstall
0 Zynga 5000 0.50 0.00 0.00
1 Zynga 0 0 15000 0.33
2 EA Mobile 22000 0.57 0.00 0.00
3 EA Mobile 0 0 26000 0.49