Théo Simier

Data Analyst / Scientist Python - SQL - Data Viz

Moves to Paris, Paris, Rennes, Nantes, Lille

Location and geographical scope

Paris, France
Can work in your office at
  • Paris and 50km around
  • Paris and 100km around
  • Rennes and 100km around
  • Nantes and 100km around
  • Lille and 100km around



Stack Overflow

Stack Overflow : Tousalouest Tousalouest
  • 145 Reputation
  • 10 Bronze
  • 2 Silver
  • 0 Gold


  • Français

    Native or bilingual

  • Anglais

    Full professional proficiency

  • Espagnol

    Limited working proficiency

  • Chinois


Skills (30)

Théo in a few words

Ayant une double formation en Data Science et Business, je suis à même de vous aider à analyser vos données.

J'ai l'habitude de travailler sur:

les languages suivants: Python, R, SQL, Scala
les domaines suivants: Machine Learning, Data Visualisation, Data Mining, Data Analytics, NLP
les outils suivants: Heap, Tableau,, Jupyter Notebook, Github, Postgresql, BigQuery
les librairies suivantes: Pandas, Scikit-Learn, Matplotlib, Seaborn, NLTK, Gensim

N'hésitez pas à me contacter.


Malt Community - Malt

High Tech

Data Analyst - Data Mining

Paris, France

June 2019 - Today

Data Science/Data Engineering missions:
• Creation of a machine learning algorithm to predict the win rate and revenue of leads
• Data enrichment topics (scrapping of open source datasets)
• Creation of a NLP algorithm to make recommendations based on text inputs
• Creation of custom tables in BigQuery

Data Analyst missions:
• Data Mining exploratory analysis to better understand our users (Data Viz, Clustering, etc)
• UX/UI analysis to give insights to product managers and designers
• Monitoring KPIs during the launch of new features
• Tutor of Interns

Stack: Python (XGBoost, Scikit-Optimize, mlflow, Scikit-Learn, Pandas, Matplotlib, Seaborn), Scala, SQL, Heap, Chartio

Kaggle competition/Thesis

High Tech

Data Scientist

March 2019 - June 2019

Moderation of online content through NLP and Deep Learning
Classification accuracy of 87%.

I explain the most commonly used techniques to gain meaningful information from text and apply them on a concrete example:
the creation of an algorithm capable of monitoring questions by checking if
a question respects or not the terms and conditions of a question-and-answer website named Quora.
I started by creating statistical features, normalizing the questions and transforming them into a format that classification algorithms can handle.
Finally, I used a Logistic Regression, a Random Forest and a Deep Learning model to predict if the questions were compliant or not.
The best algorithm reached an accuracy of 87%.

HP Inc

High Tech

Category Manager Junior

Paris, France

January 2017 - December 2017

• Forecasting of sales and revenue through Data Analytics
• Tracking of KPI, forecasting of the quarterly sales
• Pricing and definition of the local product offering on the SMB market
• Training of one new employee and one intern
  • Data analyse

