About Hussein
French
Native or bilingual
Arabic
Native or bilingual
English
Fluent
Experience
- Apache AirflowCommitter & PMC memberTECHApril 2023 - Today (3 years and 2 months)Paris, France- Active contributor; fix the reported bugs, introduce new features and improve the code quality and its performance.- Join the discussions and participate in deciding the future of the project.- Test and vote on the different releases, mentor the new contributors and help Airflow users to solve their problems.
- LeboncoinSenior Data EngineerE-COMMERCEOctober 2021 - Today (4 years and 7 months)Paris, France- Developing low-latency stream applications using Java, Spring Cloud Stream, KStream, Kafka and K8S for fraud detection.- Develop a scalable in-house feature store to ingest the aggregate and ingest the company events (>1B/day) in a KV store (DynamoDB) using Kafka, Kstream, FastAPI, AsyncIO, Avro and Airflow.- Designing a new Lakehouse architecture to apply GDPR on the legacy datalake and optimize the data processing by optimizing the data files (compaction, z-order, indexing, ...) using Java Spark, HUDI, Airflow, Kafka, S3, Avro, Glue and K8S.- Improve the data platform: migrate Airflow from LocalExecutor to Celery, migrate spark jobs from EMR (YARN) to K8S, migrate Airflow operators to the new deferrable (async) mode to reduce the infra cost, migrate Spark to jdk11 after patching the Hive which doesn’t work with jdk11.- As a Sr. DE, I give data/infra courses (Airflow, Spark, Terraform, K8S, ...), I help data teams to design their projects and overcome challenges, and I lead the contribution to open source data projects.
- Data4RiskData Engineer & Head of DataTECHFebruary 2019 - October 2021 (2 years and 8 months)Paris, France- Designing and implementing stream applications and batch ETL using Docker, Pyspark, Kafka, Argo Workflows, mongoDB, MySQL, MinIO and Kubernetes on GCP and OVH cloud to collect and process weather data.- Designing and implementing a low-latency Lakehouse using PySpark streaming, Delta Lake, Hive and K8S, to support ACID transactions, update and Delete operations and time travel on the big data tables.- Leading the DS team: desing and train ML models and pipelines using Keras, Tensorlow, MLlib and other libraries to classify and process satellite images and deploy these models using MLflow and TF serving.- Designing and implementing a datalake for a financial data platform: stream Spark on k8s jobs to ingest Kafka events in the parquet datalake, and batch Spark on k8s ETL pipelines scheduled by Argo
Recommendations
Be the first to recommend Hussein
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Master en Data ScienceGrenoble INP ENSIMAG2019
- Licence en InformatiqueUniversité libanaise - Faculté des sciences2017