You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Hussein AwalaHA

Hussein Awala

Senior Data Engineer

€1,000/day
Châtillon, FR
3-7 years

Average response time: 1 hour

About Hussein

I'm a Senior Data Engineer in the Ad Network team at Voodoo and a Committer & PMC member at Apache Airflow.

I have worked on various types of projects, including:
- Building GDPR-compliant Lakehouses and analytics platforms using Apache Iceberg or Apache Hudi.
- Developing low-latency stream applications (stateful and stateless).
- Creating ML platforms on top of Kubernetes clusters, and serving ML models with FastAPI and GRPC.
- Setting up and deploying Spark On Kubernetes clusters with hundreds of jobs and thousands of daily job runs.
  • French

    Native or bilingual

  • Arabic

    Native or bilingual

  • English

    Fluent

Can work on-site
Châtillon (up to 50km)

Experience

  • Apache Airflow
    Committer & PMC member
    TECH
    April 2023 - Today (3 years and 2 months)
    Paris, France
    - Active contributor; fix the reported bugs, introduce new features and improve the code quality and its performance.
    - Join the discussions and participate in deciding the future of the project.
    - Test and vote on the different releases, mentor the new contributors and help Airflow users to solve their problems.
    Airflow Python Kubernetes AWS Helm flask Github Actions GCP Vault SQL
  • Leboncoin
    Senior Data Engineer
    E-COMMERCE
    October 2021 - Today (4 years and 7 months)
    Paris, France
    - Developing low-latency stream applications using Java, Spring Cloud Stream, KStream, Kafka and K8S for fraud detection.
    - Develop a scalable in-house feature store to ingest the aggregate and ingest the company events (>1B/day) in a KV store (DynamoDB) using Kafka, Kstream, FastAPI, AsyncIO, Avro and Airflow.
    - Designing a new Lakehouse architecture to apply GDPR on the legacy datalake and optimize the data processing by optimizing the data files (compaction, z-order, indexing, ...) using Java Spark, HUDI, Airflow, Kafka, S3, Avro, Glue and K8S.
    - Improve the data platform: migrate Airflow from LocalExecutor to Celery, migrate spark jobs from EMR (YARN) to K8S, migrate Airflow operators to the new deferrable (async) mode to reduce the infra cost, migrate Spark to jdk11 after patching the Hive which doesn’t work with jdk11.
    - As a Sr. DE, I give data/infra courses (Airflow, Spark, Terraform, K8S, ...), I help data teams to design their projects and overcome challenges, and I lead the contribution to open source data projects.
    Airflow Spark Python Java Kubernetes AWS Kafka Hudi parquet MLflow Github Actions Terraform avro FastAPI
  • Data4Risk
    Data Engineer & Head of Data
    TECH
    February 2019 - October 2021 (2 years and 8 months)
    Paris, France
    - Designing and implementing stream applications and batch ETL using Docker, Pyspark, Kafka, Argo Workflows, mongoDB, MySQL, MinIO and Kubernetes on GCP and OVH cloud to collect and process weather data.
    - Designing and implementing a low-latency Lakehouse using PySpark streaming, Delta Lake, Hive and K8S, to support ACID transactions, update and Delete operations and time travel on the big data tables.
    - Leading the DS team: desing and train ML models and pipelines using Keras, Tensorlow, MLlib and other libraries to classify and process satellite images and deploy these models using MLflow and TF serving.
    - Designing and implementing a datalake for a financial data platform: stream Spark on k8s jobs to ingest Kafka events in the parquet datalake, and batch Spark on k8s ETL pipelines scheduled by Argo
    Python Spark kafak Delta Lake Kubernetes Argo Workflows MongoDB GCP OVH MLflow Gitlab CI Argo CD Terraform

Recommendations

Be the first to recommend Hussein

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master en Data Science
    Grenoble INP ENSIMAG
    2019
  • Licence en Informatique
    Université libanaise - Faculté des sciences
    2017

Skill set

Categories