Again, I'm Evan Jones, One of the course designers for data to insights. I've been teaching data analysis for over 10 years. My life at Google before developing courses like this one was in Google Finance. We built pretty fun machine learning models to predict and optimize expenses here at Google. I'm thrilled that Google has made their internal petabyte scale data analysis tools available to the world through the Google Cloud Platform. It's that platform that we're going to be using to explore and derive insights using their Big Data Tools. Let's take a quick look at the agenda of topics we're going to cover. First, we'll start with the basics of Google Cloud Platform and wildland in the Cloud handle your computer and storage needs enables massive scalability. After the fundamentals of Cloud, we'll go into the Big Data Tools available to you as a data analyst. We're going to focus on BigQuery, Google Data Studio, and Cloud Dataprep to start. Third is where we'll start coding in SQL, the Structured Query Language. Forth, we'll explore the BigQuery pricing model for query processing and data storage. Next up is a discussion on dirty data and how we can clean it up with SQL or a new GUI tool. Sixth and seventh on this list is how you can create and store your own datasets on BigQuery from your queries or from external data sources. We'll close here with an introduction of visualization and how to create reports from your data within Data Studio. Moving on to some of the more advanced topics we're going to cover, you're going to look at joins and unioning your datasets together and BigQuery as some of the more advanced statistical functions and user-defined functions you may not have seen before. Afterwards is one of my favorite sections on how repeated fields in arrays work within BigQuery nested data structures. Again, here we'll close with some more advanced data visualization tips within Data Studio. In these last sections, we'll walk through one of the most popular topics, which is troubleshooting query and dataset performance. Lastly, before wrapping up, we'll close this specialization with a critical topic of data security and access controls. This class is targeted primarily at data analysts who query their business datasets using SQL and create insightful reports and dashboards. First and foremost, we're take a look at those challenges that are faced by data analysts. Let's just jump right into those. If you've run any queries in your life, particularly like when I was learning database processing in school, my instructors and teachers would say, "Hey, run this one query and then you can go to the bathroom or do whatever you need to do while your query is running. Upper left you see that queries that are taking too long, that could potentially stall your analysis. Or what about if I wanted to combine 15 data sources in query all of them, and I wanted to do that within a reasonable amount of time. A lot of times that was hard to do. In the middle, I'll say it wasn't a querying problem, but it was actually an infrastructural problem. I'm a data analyst or a data scientist, I am not a hardware purchasing department. I don't know about buying servers and storing multiple versions of hard drives that are redundant in case a hard drive ploter fails. I have to maintain the network of all of my data as it relates to processing my queries and accessing the data word that's stored. I don't want to deal with any of that kind of infrastructure. But I have to as a necessary evil if want to be a big data shop. Or if you're using say like Hadoop on your clusters, you're managing your clusters but you've had this amazing capital outlay to get this awesome processing cluster. But now you're punished by your own success because now your clusters can't scale because your organization says, you did such an amazing job, now we have 10 times the data. Can your clusters handle it or do you need to buy more and keep expanding out your ever-growing Infrastructure empire? Again, how much of the business of building infrastructure do you want to be in, versus spending that opportunity cost of infrastructure, versus writing out those amazing queries to those machine learning models to get those insights. Lastly, is pretty parent one, which is just a good cost. Maybe you have a tone of data, but you literally just can't afford to process all of it just because performance wise, it's prohibitive on your machines, and you can only create a few columns. Or it's just the monetary cost, which processing that much data and storing that much data is just prohibitive. Last but not least, if you have no central place where you can just dump all this data into like a staging area or analytics warehouse, that could be a problem as well. When you go into, these are a lot of the same exact problems that Google had growing up, and faced with a torrent of search, indexing data and adds volume data and the necessary problems that Google as a big data organization had to solve. We'll see exactly how they did that and the benefits of technology and time that have evolved to create a lot of these cool Google Cloud Platform tools, these Big Data Tools like BigQuery.