How to import Google Analytics data in R

As a website owner, Google Analytics is one of the most important tools for me. It gives me statistics about my visitors and target group, which pages are interesting to people and how often someone returns. The online dashboard of Google Analytics shows you the most important metrics, which you can easily adapt yourself. But sometimes you want more sophisticated analyses, such as correlations and predictions. The online dashboard is not the right tool for this, but probably some statisticians or analysts can help you by using this tutorial.

 

Google Analytics features

Google Analytics has so many features, you can get lost easily. I will write another blog post about the main features Google Analytics offers, for now I will focus on how to set-up a connection between Google Analytics and R and how you can import and analyze data in R.

 

Let’s get started!

Before we can start analyzing our Google Analytics data in R, we have to set up a project first.

  1. Go to https://console.developers.google.com and search for Analytics API in the ‘Library’ section.Set up Google Analytics in R
  2. Click on ‘Create Project’.
  3. On the next page click on ‘Create’.
    Set up Google Analytics in R
  4. Now type your project name. For this post I use the name ‘Test-project’. Select your e-mail preferences and click ‘Create’.
    Set up Google Analytics in R
  5. After a few seconds, your project is set up. Now go to the ‘Credentials’ section and click on ‘oAuth consent screen’. Enter your website credentials and click ‘Save’.
  6. Go back to the ‘Credentials’ section, click to create new credentials and select ‘OAuth client ID’.Set up Google Analytics in R
  7. On the next screen we select the type of client. In order to get our Google Analytics data in R, we select ‘Other’ and give our client ID a name.
    Set up Google Analytics in R
  8. After giving your client ID a name, you see a page with your client ID and client secret. Keep this screen open to copy the credentials into R later.
  9. Now let’s go to the R environment. First, we need to install the package ‘RGoogleAnalytics’. Also we create two variable to store the client ID and client secret. Then we create a token based on the two variables.
  10. The last line of code asks you if you want to use a local file to cache OAuth access credentials between R sessions. When you type ‘No’, the console gives you a link which you put in your browser. In return, it gives you an authorization code you need to paste into R.
  11. You can validate your authorization code by using the function  validateToken()
  12. Now the fun part can start! The connection is all set-up, so we can extract the data we want to work with. First thing to do is to create a query including the dimensions and metrics you need in your dataframe. A list with all dimensions and metrics can be found here. You should also specify the start and end date, the maximum number of results to return and on which dimension you want to sort the output. Finally you should provide you table.id, this can be found in your account on analytics.google.com.Use the  QueryBuilder() function to create a list object and finally create your dataframe by using  GetReportData() .

So that’s it, your Google Analytics data is now in R! You can do whatever you want with it like any other dataframe in R.

World full of data author

Who I am


Hi! My name is Claudia, a freelance data analyst/scientist. This is my space on the internet where I share knowledge and experience with everyone who wants to become a better analyst. Read more about my work as a freelancer here.

Share this post on

Share this post on

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.