US Open Championship from 1881-2021
CONCEPT
This is also one of my undergraduate data visualization assignments. This task focuses on developing an understanding of high-dimensional data types and formats, as well as the interaction and presentation tools needed to use them in different visualization technologies. We need to use different data visualization techniques to analyze the performance of champion players, such as nationality, gender and score. Finally, summarize and present the trends and breakthrough analysis found in the visualization process.
DATA
This dataset is from the US Open Championship, which includes 141 years of men's and women's championships between 1881 and 2021. The dataset has a variety of data types and forms, including year, gender, champion‘s name, nationality, champion seed, match time, runner-up's name, nationality, and score. We have to use the required methods for analysis and visualization, which requires conversion into different formats for comparison.
There are three blank columns for Champion Seed, Minute, and Runner-Up Seed. These three values were not properly recorded before 1945, so we removed them from the dataset. Also, since Tableau does not recognize nationality codes, two columns for the winner and runner-up countries are created and populated. In scoreboard fields, the data is in an incomparable text format. We convert this into 10 new columns representing wins and losses for each set.
Parallel Coordinate by Wins, Win Rate and Losses(Normalized)
Treemap by Gender, CNT-Champions and Avg-WinRate
Geographic Map for the distribution of champions in Europe
PROCESS
In this project, the data visualization process is as follows:
-
Data understanding & Data preparation
-
Use different visualization techniques to visualize and analyze player's nationality, gender, as well as time series and country grouping performance.
1)Parallel Coordinate
2)Treemap
3)Geographic Map
4)Top Player Performance
-
Summary and Conclusion
Tools: Excel, Tableau
Time Series Analysis: All the champions in the US Open Championship