Familiarize yourself with the Metabase tool and the sample x-rays. You can find the documentation at https://www.metabase.com/learn/.
Add our two SQLite files as separate datasets. You can also upload CSVs to Metabase to the Sample Database.
Create a stacked bar chart (a bar for each type of domestic accommodation) over time. You can find documentation at https://www.metabase.com/learn/metabase-basics/querying-and-dashboards/visualization/bar-charts. It's fine to follow their example to learn about the bar chart and then translate your learning into your own bar chart. Did you notice the data problem for clinics/centers and clinics?
Now that you are familiar with the basics of Metabase how about you create some new chart of your own using the data. Remember you can select, filter, aggregate, and group the data prior to visualizing it. Present your created chart in class.
Task
Create a chart showing the top foreign residences of people visiting Berlin. It could look something like the chart below
When do the two regularly occurring peaks happen over the year for visitors from the US? Can you make out differences within these two peaks?
Task
It is possible to predict the number of future visitors from historical data. We will do this starting tomorrow. What are downsides to predicting the number of visitors this way? What are the implicitly made assumptions?
What are indicators predictive of visitors? Which could be publicly accessed and downloaded and help us predicting US visitors?
Task
With Google Trends you can get data from Google showing what people search on Google depending on certain time spans and geographic locations. Play with Google Trends. Can you find search terms with a similar pattern than our Berlin visitors from the US?
What is the value that reported by Google to indicate the volume of searches (the y-axis on the Google Trends charts). Research the definition and draw conclusions when comparing different search terms using Google Trends.
"Indexing: Google Trends data is pulled from a random, unbiased sample of Google searches, which means we don’t have exact numbers for any terms or topics. In order to give a value to terms, we index data from 1-100, where 100 is the maximum search interest for the time and location selected." "Normalization: When we look at search interest in a topic or query, we are not looking at the total number of searches. Instead, we look at the percentage of searches for that topic, as a proportion of all searches at that time and location."
Building on yesterday's exploration of data from Google Trends, let's create a final chart contrasting the official data of US visitors with searches on Google. For this download the google_trends.sqlite file below.