Analyses

Made for fun and training, here is a selection of data analyses and visualizations. Most of them are coded in Python and R.


Names

Age predictor for French first names

Tell my your name, I will know your age. Using data on the 1000 most popular French first names, this dashboard guesses the most likely age interval given a name. It also provides the most closely correlated and most popular names by generation.
🇫🇷 See dashboard


location_history

Google location history

Based on my Google location history daily data, I mapped four years of my life to see how I move around home, in a graphical way. R
See notebook


time_series_prediction

Time series prediction

Comparison of models to predict time series, namely daily temperatures in New York City. This notebook goes from naive forecasts up to LSTM neural networks, and compares their performances. Python
See notebook


stats_ab_testing

Hypothesis testing

Cheatsheet for frequentist A/B testing. Practical summary of the statistical notions and formulas, and their implementation in Python. Python
See notebook


ga360_analysis

Google Analytics 360

How to query Google Analytics 360 raw logs on BigQuery, and start some basic aggregations and insights, to get the ball rolling for deeper analyses. Python
See notebook


france_kings

Kings of France

Since the fifth century, France was ruled by 87 different kings. But how did their reign duration increase, from the Dark Ages up to the French revolution ? R
See notebook


french_baby_names

French baby names

If you were born one century ago in France, you had 15% chances to be named Jean or Marie. Today first names are much more diversified, but show strong generational trends. Python
See notebook


aircraft_crashes

Aircraft crashes

Afraid of flying ? Data may help you rationalize. Analyzing the causes of 5000+ airplanes crashes since 1908 gives insights about what can go wrong, and how flight security improved over time. R
See notebook


wifi_paris

Public Wifi in Paris

How touristic is the city ? Analysing a dataset containing of connections made on Wifi hotspots in Paris, we find insights about tourists most frequented places and countries of origin. R
See notebook


Fitcoach

Stanford class project: R package for Fitbit

Developed as the final project for Stanford University class Computing for Data Science in 2016, this R package provides analysis and predictions for the physical activity of people wearing Fitbit trackers. R
See code