About Fabien
French
Native or bilingual
English
Fluent
Chinese
Basic
Experience
- Digital Luxury GroupSenior Data EngineerOctober 2024 - Today (1 year and 8 months)Shanghai, China
Platform Development:
- Built end-to-end data platform from scratch using DBT (6-month initial development, ongoing feature expansion)- Independently developed all data/DBT architecture and pipelinesConsolidated 24 second-hand watch marketplaces into unified analytics system.- Engineered ETL pipelines standardizing diverse business models (auctions, subscriptions, individual listings) into single data model- Implemented fuzzy matching algorithms with multi-stage logic to handle partial/incorrect reference numbers using watch featuresData Processing & Quality:
- Automated weekly incremental data ingestion with 50+ validation tests and data freshness alerts- Built comprehensive table documentation covering all logic and data lineage- Created custom tool for selective table refresh when needed- Built global MSRP tracking with multi-currency conversion based on client geography/demand- Processed 4M+ product images weekly using RabbitMQ across all projectsClient Delivery:
- Delivered Tableau dashboards with custom index calculations for well-known luxury goods companies- Created configurable email automation using Airtable and Tableau APIs with product-specific templates currently running 10 emails weekly/monthly/quarterly across 4 projects.- Provided on-demand custom exports (typical 2-3 week implementation cycles)- Platform success resulted in 2 major client acquisitions with 2 additional prospects in pipelineInfrastructure & DevOps:
- Migrated legacy Airflow and PostgreSQL from 2017 versions to current releases- Implemented CI/CD pipelines using GitLab runners, Docker registries, and GitLab Pages- Automated S3 backups and data lifecycle management with purging for oversized tables- Integrated GitLab-Airtable automation workflows- Introduced automation tools (n8n) and containerization standards- All systems orchestrated through Apache Airflow with comprehensive monitoring and alerting - DockerData EngineerTECHMay 2021 - July 2023 (2 years and 2 months)I was the first new data engineer to arrive after the docker split (docker sold swarm and was rebranded with 60 employees) of the whole data environment . I helped recruit 7 data engineers and 1 data analyst. I migrated legacy code , led POC to modernize the stack and was in charge of Looker (data visualization tool) and spent the last 6 months working a lot on snowflake administration.Data visualizationI was in charge of Looker the first year , I was doing a quick introduction to every new employee that needed it and reviewing/merging ALL the PRs.
ETL
Full migration + improvement of our legacy code (spark + python / jenkins)Built the core pipelines of DBT that became the main pipelines afterward.InterviewsI made around 20 interviews mostly for data engineers (including 7 that got hired after that) / data analysts and data scientist.POC i led
- MonteCarlo-DBT (approved)-BigEye-Dagster (approved)-Alembic (approved)-Atlan (approved)-Snowflake administrationI spent the last 6 months administrating snowflake with one colleague which include :-Full rework of the role system with versioning.-Script that runs hourly to alert / detect data freshness , snowpipe errors etc .-Script for credit consumption that alerts on spikes , gives a daily report of credit usage per warehouse ...-Script that detects heavy queries (mostly from Looker) that are costly and can be improved. - Pernod Ricard EMEA LATAMData engineerWINE AND SPIRITSJune 2022 - September 2022 (3 months)Paris, FranceTeam reinforcement to help releasing a big project in pyspark / databricks
Recommendations
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Master en informatiqueEPITECH - European Institute of Technology2019
- Master en informatique北京交通大学2018