TidyTuesday
   HOME

TheInfoList



OR:

TidyTuesday, also noted as Tidy Tuesday, tidytuesday, or #tidytuesday, is a weekly
community of practice A community of practice (CoP) is a group of people who "share a concern or a passion for something they do and learn how to do it better as they interact regularly". The concept was first proposed by cognitive anthropologist Jean Lave and edu ...
that is currently organized by the Data Science Learning Community (DSLC). A new data set is highlighted each week for participants to practice exploring, visualizing, and sharing findings. Participants can follow the daily
hashtag A hashtag is a metadata tag operator that is prefaced by the hash symbol, ''#''. On social media, hashtags are used on microblogging and photo-sharing services–especially Twitter and Tumblr–as a form of user-generated tagging that enable ...
#tidytuesday on social media.


History

TidyTuesday was started by Tom Mock, a product manager at Posit PBC, on April 1, 2018. The motivations to create this was for newcomers to data and more experienced data scientists to feel less socially isolated and a means to practice skills like acquiring, cleaning, wrangling, visualizing and presenting data. Some participants have shared feeling inspired by others' data visualizations and noting that most people will share their code in order to replicate their work.


Impact

TidyTuesday has also been used by other groups or features published data. R-Ladies Global have used TidyTuesday datasets as a
hackathon A hackathon (also known as a hack day, hackfest, datathon or codefest; a portmanteau of '' hacking'' and ''marathon'') is an event where people engage in rapid and collaborative engineering over a relatively short period of time such as 24 or 48 h ...
to practice data skills. In February 2021, Allen Hillery, Athony Starks, and Sekou Tyler, started the #DuboisChallenge. This challenge had participants use modern data visualization tools to recreate the data visualizations by sociologist and activist W.E.B.Du Bois. Then in 2021 and 2022, TidyTuesday highlighted these datasets for the data community. In 2021, TidyTuesday featured the zipcodeR dataset that contains 41,000 ZIP codes for analysis. Educators training
data scientists Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processing, scientific visualization, algorithms and systems to extract or extrapolate knowledge from potentially noisy, structu ...
have struggled to coordinate their preparation, but some have suggested to create a portfolio to have highlight technical skills and data thinking skills. TidyTuesday is one suggested way to find datasets to create a formal, visual project. This can be a means to help teach novice data practitioners on how to better program in programming languages like the
R programming language R is a programming language for statistical computing and data visualization. It has been widely adopted in the fields of data mining, bioinformatics, data analysis, and data science. The core R language is extended by a large number of so ...
.


See also

*
Tidyverse The tidyverse is a collection of open source packages for the R programming language introduced by Hadley Wickham and his team that "share an underlying design philosophy, grammar, and data structures" of tidy data. Characteristic features of t ...


References


External links

* {{Official website, https://www.tidytuesday.com/
GitHub page

Python TidyTuesday - GitHub

Data Science Learning Community (DSLC)
Data science R (programming language) Statistics Data and information visualization