You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Maher NaijaMN

Maher Naija

Ingénieur IA | Fine-tuning LLM ·RAG ·Agents| MLOPS

€850/day
Châtillon, FR
15+ years

Average response time: 1 hour

About Maher

Ingénieur IA avec 14+ ans d'expérience spécialisé en fine-tuning LLM à grande échelle et agents RAG en production, au service de 1 600+ utilisateurs enterprise.

Fine-tuning LLM : entraînement distribué sur 16 H200 GPUs avec FSDP, LoRA et HF Trainer — réduction des coûts de 3x vs. full fine-tuning. Inférence haute performance avec vLLM (100 tokens/s, 99% uptime).

RAG & Agents IA : pipelines multi-agents avec LangGraph et LangChain, APIs OpenAI/Anthropic, semantic search Qdrant — déployés en production enterprise pour 1 600+ utilisateurs.

MLOps end-to-end : Kubeflow, MLflow, Kubernetes (CKA certifié), CI/CD sur AWS. Time-to-production réduit de 50%.

Chez Dassault Systèmes Outscale : plateforme IA/ML complète de l'entraînement distribué (FSDP, PyTorch) à l'inférence en production. Expertise Big Data (Bouygues Telecom, Kafka/Spark) et réseaux (Qosmos, C). Fondateur de thejobbooster.cloud.
  • English

    Native or bilingual

  • French

    Native or bilingual

Remote only
Primarily works remotely

Experience

  • Dassault Systèmes Outscale
    AI Architect
    SOFTWARE PUBLISHING
    December 2020 - Today (5 years and 6 months)
    Paris, France
    1. Machine Learning / AI Platform – Agentic AI
    • • Architected LLM fine-tuning to 16 H200 GPUs using LoRA and FSDP (Fully Sharded Data Parallel), cutting training cost 3x vs. full fine-tuning
    • • Architected production-grade LLM inference with vLLM on GPU clusters, serving 1,600 users at 99% uptime, 100 tokens/s throughput, and 3x perf improvement
    • • Mentored 7 ML/AI engineers to production-deploy models; established MLOps practices for model lifecycle management, monitoring, and reproducibility, re ducing time-to-production by 50%
    • • Led AI platform architecture, integrating open-source LLMs on GDPR-compliant sovereign cloud
    • • Designed a multi-agent RAG pipeline with LangGraph and LangChain, pro cessing 10,000+ enterprise documents
    PYTHON VLLM LANGGRAPH LANGCHAIN KUBERNETES MLFLOW RAG FSDP LORA PyTorch, LangFuse, Kubeflow, Qdrant, OpenTelemetry, Prometheus, MCP
    2. Accounting / Billing Platform: 7 platforms, multi-region/AZ
    • • Owned architecture and roadmap for 7 multi-region billing platforms (SLA, SLO, PRA) achieving 99.99% availability
    • • Implemented automated platform provisioning with Terraform and Ansible, reducing environment setup time from days to under 2 hours
    • • Built event-driven pipelines for real-time billing data processing, eliminating billing delays and reducing revenue reconciliation errors
    • • Cut billing software upgrade cycles from 1 week to 2 days with CI/CD
    TERRAFORM AWS KUBERNETES EKS GITLAB CI DOCKER EKS (Elastic Kuber netes
    vllm LLM LangGraph RAG FSDP
  • Bouygues Telecom
    Senior Data & Platform Engineer
    TELECOMMUNICATIONS
    January 2015 - December 2020 (5 years and 11 months)
    Paris, France
    1. Big Data / Data Lake Platform – National Fixed-Network Monitoring
    • • Designed and implemented data architecture for 9 device types, 3 million net work access devices, delivered a national supervision map enabling anomaly detection at scale
    • • Built ETL workflows processing 180 GB/day of telemetry data from 3 million network devices, feeding the national supervision map in real time
    • • Drove cross-functional deployment, delivering integrations 2 weeks ahead of schedule for a national-scale rollout to 3 million devices KAFKA
    MLflow Apache Kafka Kubernetes Apache Spark Airflow
  • Qosmos
    Software Engineer
    TECH
    September 2011 - December 2014 (3 years and 3 months)
    Paris, France
    1. as
    • • Achieved 10 Gbit/s throughput by developing and virtualizing the Deep Packet Inspection (DPI) engine for horizontal scalability
    • • Implemented and optimized network protocol parsers (RFC-compliant), con tributing to processing with sub-millisecond per-packet latency
    Python Linux Machine learning Pytorch

Recommendations

Be the first to recommend Maher

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Certified Kubernetes Administrator (CKA)
    Linux Foundation
    2024
    Certified Kubernetes Administrator (CKA)
  • Master's Degree in Innovation Management
    ENSAM
    2013
    Master's Degree in Innovation Management

Certifications

  • Certified Kubernetes Administrator (CKA)
    Linux Foundation
    2024

Skill set

Categories