The Question for which I need an answer
The most important thing in data analysis is the question you want answered. In my case, I want to see which countries have the most interest in the SAP SuccessFactors Add-On. I suspect US and Germany will be at the top. But I have no idea which other countries show the most interest. Let us find out.
These are the steps and the R code to perform functions
Step 1: Read the data file which is available to me as a CSV file.
downloads <- read.csv("downloads.csv")
This reads the raw data I got from the data base into a data frame called downloads.
Step 2: Select just the data I need.
The new dataframe downloads will have multiple rows with the same country names. I want to find out how many times each country name is listed. I can use the table function to find that out.
countrycount <- as.data.frame(table(downloads$Country.Name))
The above code creates a dataframe with the name of countries in one column and the number of times they occur in the original table in a second column names Freq,
Step 3: I then want to sort the data with the country with the most downloads on the top.
This is the R code to do that.
sort(countrycount$Freq, decreasing = TRUE)
Step 4 : The last step is to write the data to a CSV file so that I can share the data with other people and systems. This is the code for that.
Now let us look at the data.
Customers from the US downloaded the Add-On the most. No surprise there. We have thousands of SAP ERP HCM customers in the US. The second is Germany. No surprise there either. Third is Australia. Fourth is Saudi Arabia. That is good to know. There is a lot of interest in Australia and Saudi Arabia. There is a lot of interest from many European customers for the Talent Hybrid model. This information gives me and my product management colleagues enough insight to make some data driven decisions. Of course I have a lot more data than this and can find answers to many more such questions.
The next step could be to visualize this data to convey the information quickly. Many tools including R can do that. I will get to it in the future.
|United Arab Em.||32|