Big Data in Rio Olympics 2016

From predicting winners to training athletes to keeping the Zika virus at bay, Big Data is an integral part of the upcoming Rio Olympics

Chew on this. The total IT budget for the Rio De Janeiro Oympics 2016 is $1.5 billion and all of the games data is stored on the cloud. Back in 2012, the London Olympics became the first Big Data driven sporting event where 60 GB of data was transferred and 30,000 tweets were sent every second. 15 terabytes of data was generated every day by sports enthusiasts around the world. This year, these numbers are set to become a joke.

The Rio Olympics 2016 is fundamentally using the power of Big Data in three distinct ways:

Predicting outcomes and winners
Better training athletes
A disease-free games season

1. Predicting outcomes and winners

Even before the games begin in Rio, Big Data is being used to predict winners. For instance, Gracenote, a global provider of entertainment data, is leveraging its Big Data and predictive analytics expertise to provide detailed predictions that reveal which countries will top the medal tally, specific athletes who will emerge winners at the Olympics 2016, as well as an overall medals tally. According to Gracenote, it has used performance data from thousands of events in addition to data from an extensive Olympics database dating back to 100 years, to create what it calls a ‘Virtual Medal Table’ (VMT), which is essentially a dashboard which presents these results in an intuitive manner. Broadcasters, sports websites and mobile providers will use data from this dashboard and personalize results for audiences from their respective countries. Here’s a sample of Gracenote’s Rio VMT for 2016:

2. Better training athletes

It is not just third-party providers who are using Big Data at the Rio Olympics 2016. The coaches of various countries are enhancing their training programs by using analytics data of their players as well as data available in the public domain. For team sports, analysis of historical player performance data, performance trends against specific competition, and fitness statistics are sourced through data analytics. For non-team sports, predictive analytics is used to identify who a particular athlete is pitted against, his/her strengths and weaknesses, insight into past strategies, and most importantly, to enhance fitness based on trends. Simply put, Big Data Analytics connects an athlete to his/her strengths and provides a mirror to his/her sporting abilities.

3. A disease-free games season

No entry for Zika at Rio Olympics 2016! Thousands of Brazilian officers and dozens of non-profit organizations are involved with the Rio Olympics 2016, to ensure a disease-free games season. With looming threats of the Zika virus, as well as Dengue and Chikungunya, officials are leveraging social data to combat viruses. IBM is helping the organizing committee by leveraging Big Data Analytics to crowdsource data from social media. The company is organizing and arranging huge amounts of online data, including the frequency and distribution of social media comments and conversations about Zika and other viruses.

IBM is also using its cloud computing backbone to analyze Portuguese-language Twitter postings about the virus. Additionally, it is also analyzing GPS-enabled data around the prevalence of the Aedes aegypti mosquito, which spreads the Zika virus. All of this data is synced up with other auxiliary data like weather, location of airports around Brazil where potential Zika infected people are quarantined. Other high-risk areas are monitored in real-time to identify outbreaks, earmark high-risk areas and prevent probable infection to athletes as well as the global audience that descends on Rio.

Computation process behind Big Data analytics at Rio Olympics 2016

In order to better understand the process that can be followed for predictive analysis of Big Data in scenarios like these, let’s consider the nature of data that needs to be processed in order to create performance leader boards. Let’s understand this step-by-step:

Step 1: The first step is to identify the key metrics on which the result would depend. The available data for 100 odd years can be segregated into Country, Sport, Result, Name of the athlete, age of the athlete, previous performance records, among others.

Step 2: Once the metrics is identified, the data can be divided into two sub data-sets, say a Training data-set and a Test data-set.

Step 3: Data from the Training data-set can then be used to create Models. The Test data-set, on the other hand, contains all of the historical data including definite results.

Step 4: Data from the Test data-set is fed into the Model and mapped for results. This way, we can get data that can predict results based on historical data.

However, for crowdsourcing social data for preventing disease outbreaks, a relatively simpler process can be used:

Data around social media interactions from different social channels can be streamed live, probably using Spark Streaming, into a Hadoop Distributed File System (HDFS). Once all the different data is transformed into a standardized format, this data can be analyzed for co-ordinates such as location, time of the data etc. Based on an exhaustive set of keywords, relevant data can be retrieved and real-time alerts can be created for efficient disease control.

These are just some of the ways in which the Big Data is playing its part in the biggest sporting event of the year. Do you have any ideas of how Big Data can be used in sporting events or for better public health? Feel free to write your comments below.

Want to learn Big Data from industry experts? Edureka has created a top-class course on Big Data & Hadoop that helps you learn more about Hadoop clusters, HDFS, HBase and much more. Click here if you wish to know more.

Rio Olympics 2016: Big Data powers the biggest sporting spectacle of the year!

1. Predicting outcomes and winners

2. Better training athletes

3. A disease-free games season

Computation process behind Big Data analytics at Rio Olympics 2016

Recommended videos for you

Webinar: Introduction to Big Data & Hadoop

Spark SQL | Apache Spark

Introduction to Big Data TDD and Pig Unit

Hadoop Tutorial – A Complete Tutorial For Hadoop

What Is Hadoop – All You Need To Know About Hadoop

Is It The Right Time For Me To Learn Hadoop ? Find out.

Power of Python With BigData

MapReduce Design Patterns – Application of Join Pattern

Distributed Cache With MapReduce

Big Data Processing with Spark and Scala

Pig Tutorial – Know Everything About Apache Pig Script

Logistic Regression In Data Science

Advanced Security In Hadoop Cluster

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

Introduction to Hadoop Administration

Hadoop for Java Professionals

Hadoop Architecture – Hadoop Tutorial on HDFS Architecture

Python for Big Data Analytics

Apache Spark For Faster Batch Processing

Big Data Processing With Apache Spark

Recommended blogs for you

What are the Key Terminologies in Hadoop Security?

Introduction of Hadoop Architecture

Azure Databricks Architecture Overview

4 Practical Reasons to Learn Hadoop 2.0

Everything About Cloudera Certified Administrator for Apache Hadoop (CCAH)

Hive Data Models: Designing Efficient Data Structures

Mastered Hadoop? Time to get started with Apache Spark

Splunk Tutorial For Beginners: Explore Machine Data With Splunk

HDFS Commands: Hadoop Shell Commands to Manage HDFS

Introduction to Hadoop

Jupyter Notebook Cheat Sheet : A Beginner’s Guide to Jupyter Notebook

Apache Pig Installation on Linux

Azure Synapse vs. Databricks – What Are the Differences?

Microsoft Fabric vs. Databricks

Why You Should Choose Python For Big Data

Hadoop Job Opportunities 101: Your Guide To Bagging Top Hadoop Jobs In 2020

What is Hadoop? Introduction to Big Data & Hadoop

Applying Hadoop with Data Science

Apache Spark Ecosystem

How To Create User In MongoDB?

Join the discussionCancel reply

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

Rio Olympics 2016: Big Data powers the biggest sporting spectacle of the year!