Description

Data scientist / engineer depuis 7 ans (MASTER software engineering), j'ai réalisé de nombreux projets autour de la data autant du côté database, ETL pipeline que du coté analyse et data science , je suis également bilingue (anglais)

J'ai travaillé pendant 1 an et demi chez Qobuz en tant que data scientist , j'ai réalisé de a à z plusieurs modèles prédictifs afin d'aider à la rétention, conversion des clients et détection de fraude et pour alimenter une data warehouse complète. (+5 à 10% de rétention supplémentaire suite à la mise en place de mes modèles).

Je suis ensuite aller travailler chez Docker, Pernod-Ricard ansi que le groupe DLG en tant que Senior Data engineer principalement sur DBT/snowflake/Bigquery/pg.

J'ai également de l'expérience en web (API, architecture) et plusieurs année de développement en C , C++

Réalisation de plusieurs projets autour de la data sur mon temps libre (principalement en Julia / Python)

Languages

French
Native or bilingual
English
Fluent
Chinese
Basic

Workplace preferences

Can work on-site

Paris (up to 10km), Lyon (up to 20km)

Digital Luxury Group
Senior Data Engineer
October 2024 - Today (1 year and 9 months)
Shanghai, China
Platform Development:

- Built end-to-end data platform from scratch using DBT (6-month initial development, ongoing feature expansion)
- Independently developed all data/DBT architecture and pipelines
Consolidated 24 second-hand watch marketplaces into unified analytics system.
- Engineered ETL pipelines standardizing diverse business models (auctions, subscriptions, individual listings) into single data model
- Implemented fuzzy matching algorithms with multi-stage logic to handle partial/incorrect reference numbers using watch features

Data Processing & Quality:

- Automated weekly incremental data ingestion with 50+ validation tests and data freshness alerts
- Built comprehensive table documentation covering all logic and data lineage
- Created custom tool for selective table refresh when needed
- Built global MSRP tracking with multi-currency conversion based on client geography/demand
- Processed 4M+ product images weekly using RabbitMQ across all projects

Client Delivery:

- Delivered Tableau dashboards with custom index calculations for well-known luxury goods companies
- Created configurable email automation using Airtable and Tableau APIs with product-specific templates currently running 10 emails weekly/monthly/quarterly across 4 projects.
- Provided on-demand custom exports (typical 2-3 week implementation cycles)
- Platform success resulted in 2 major client acquisitions with 2 additional prospects in pipeline

Infrastructure & DevOps:

- Migrated legacy Airflow and PostgreSQL from 2017 versions to current releases
- Implemented CI/CD pipelines using GitLab runners, Docker registries, and GitLab Pages
- Automated S3 backups and data lifecycle management with purging for oversized tables
- Integrated GitLab-Airtable automation workflows
- Introduced automation tools (n8n) and containerization standards
- All systems orchestrated through Apache Airflow with comprehensive monitoring and alerting
Airflow ETL DBT Tableau Python
Docker
Data Engineer
TECH
May 2021 - July 2023 (2 years and 2 months)
I was the first new data engineer to arrive after the docker split (docker sold swarm and was rebranded with 60 employees) of the whole data environment . I helped recruit 7 data engineers and 1 data analyst. I migrated legacy code , led POC to modernize the stack and was in charge of Looker (data visualization tool) and spent the last 6 months working a lot on snowflake administration.
Data visualization

I was in charge of Looker the first year , I was doing a quick introduction to every new employee that needed it and reviewing/merging ALL the PRs.

ETL

Full migration + improvement of our legacy code (spark + python / jenkins)
Built the core pipelines of DBT that became the main pipelines afterward.
Interviews
I made around 20 interviews mostly for data engineers (including 7 that got hired after that) / data analysts and data scientist.

POC i led

- MonteCarlo
-DBT (approved)
-BigEye
-Dagster (approved)
-Alembic (approved)
-Atlan (approved)
-Snowflake administration

I spent the last 6 months administrating snowflake with one colleague which include :
-Full rework of the role system with versioning.
-Script that runs hourly to alert / detect data freshness , snowpipe errors etc .
-Script for credit consumption that alerts on spikes , gives a daily report of credit usage per warehouse ...
-Script that detects heavy queries (mostly from Looker) that are costly and can be improved.
ETL DBT Docker Data Engineer Data visualisation
Pernod Ricard EMEA LATAM
Data engineer
WINE AND SPIRITS
June 2022 - September 2022 (3 months)
Paris, France
Team reinforcement to help releasing a big project in pyspark / databricks
Databricks Python Snowflake