You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Yassine GhouzamYG

Yassine Ghouzam

GCP Data/ML engineer - NLP

€650/day
Paris, FR
3-7 years

Average response time: 1 hour

About Yassine

Après avoir obtenu mon PhD en Computational Biology, j’ai travaillé deux années à Quantmetry, leader français du conseil en Data science en tant que consultant Data Scientist confirmé. Aujourd’hui, je propose mes services afin de répondre à vos problématiques en data science / machine learning.

Durant mes différentes expériences, j'ai pu participer et relever de nombreuses challenges data passionnants pour des clients de secteurs variés (Groupe de presse, banque, assurance, santé, public) a différents phases de projet (Cadrage, POC, Pilote, Mise en production)

Je peux vous accompagner sur l’ensemble des étapes qui composent un projet de machine learning :

Cadrage :
- Définition business du projet (définition du besoin, ROI attendu) /Identification du cas d’usage
- Évaluation de la conception (qualité et disponibilité des données)
- Conception de la stratégie machine learning
POC (Proof of concept) / Mise en Pilote :
- Sélection et identification du cas d’usage pour répondre à la problématique (Analyse supervisée/non supervisée, classification/régression, clustering, détection d'anomalie, analyse de graphe, système de recommandation, etc)
- Acquisition des données (données internes, données publiques externes, web-scraping, APIs)
- Construction de la cible à modéliser en collaboration avec les experts métiers
- Visualisation de données
- Feature Engineering (création de features pertinentes, traitement de features catégoriques, etc.)
- Choix du/des modèle(s) et optimisation des hyper-paramètres
- Choix de metrics pertinentes et adapté aux métiers pour valider la performance des modèles
- Interprétabilité des modèles

Mise en production :
-Contrôle automatique de la qualité des données
-Gestion automatique du cycle de vie du modèle (dérives des données et des performances)
-Logging et reporting par email
-Planification de lancement du modèle
  • English

    Fluent

  • French

    Native or bilingual

Can work on-site
Paris (up to 50km), Paris (up to 10km)

Experience

  • Quantmetry
    Data Scientist Confirmé
    CONSULTING AND AUDITS
    October 2017 - June 2019 (1 year and 7 months)
    Paris, France
    Industrial
    Monitoring and prediction of supplier quality, automobile industry – Data Scientist
    • Business scoping and use case modeling of the project
    • Data preparation and cleaning
    • Statistical analysis and KPI definitions
    • Incident prediction model using Random Forest and ensembling models
    • Dashboarding and Visualization on Power Bi
    • Python, PowerBI, SQL Server - Agile Method

    Banking & Insurance
    Cloud deployment of multiples Churn prediction models in production– Tech lead Data Scientist & Project manager
    • Automatic input data quality check
    • Automatic model life cycle management (input data and performance shift)
    • Automatic logging and email notification reporting
    • Automatic scheduling and performance monitoring
    • Modeling and pipeline optimizations
    Python, Pandas, Dash, AWS Cloud, GitHub –Agile method

    Marketing
    Newsletters subscription score modeling and industrialization, Press Group – Tech lead Data Scientist & Project manager
    • Open/Click Log scrapping
    • Data preparation and cleaning
    • Statistical analysis and KPI definitions
    • Attrition target conception and definition
    • Prediction using ensembling models
    • Model life cycle management (performance)
    • On-premise deployment
    Python, Bash –Agile method

    Healthcare
    Natural Language Processing, content tagging extraction of tags in patient’s documents– Tech lead Data Scientist
    Development of a NLP methodology to analyze Healthcare Providers. (Automatic Content Analysis, Recommendation Algorithms, Clustering, Dimensionality Reduction, Deep Learning) – Tech lead Data Scientist
    Python, Bash, Pandas, Keras , Tensorflow , SpaCy, Gensim, Dash, Scikit-learn, GitHub

    Drug side effect clustering and prediction project (Diagnostic phase) – Tech lead Data Scientist & project manager
    Use case identification
    Practicability assessment
    Data sample quality check
    Technical strategies conception

    Internal and R&D projects:
    - Responsible of healthcare Vision :
    o Identification of new AI healthcare use cases (patient, medical & drug path)
    o Commercial structuration and organization of Quantmetry healthcare vision

    - Member of computer vision expert task force :
    o MaskR-CNN template code packaging for internal usage
    o Scientific literature monitoring

    - Kaggle competition (Data sicence Bowl 2018): nucleus detection from image of biological cells image segmentation models -> ranked at best top 10%. Keras , tensorflow , OpenCV , python

    - Hackathon (Hacking Health Camp 2019): Project Manager of ReCord team, Financial Project resources management, scheduling and Tech lead of ReCord IOT prototype.


    - Technical internal communication of Deep learning Image segmentation model technologies

    - Recruitment : 15+ corrected Technical tests and 10+ technical recruiting interviews

    - Business development : Call for tender

    Python TensorFlow keras Data science Machine learning Pandas Scikit-learn Cloud computing computer vision NLP Elasticsearch Statistics Big SQL

Recommendations

Be the first to recommend Yassine

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • PhD Computational Biology
    Universtié Paris Diderot - Paris 7
    2016
    Computational biology for 3D protein structures prediction and analysis : - Development of new tools for protein structure prediction (ORION, ORION web-server) - Development of new tools for protein membrane orientation/assessment (ANVIL, MAIDEN, OREMPRO) - Study of protein structural features for protein structure prediction and protein features assignment - Protein structural organisation analysis - Molecular dynamics for protein structure analysis - Effect on alternative splicing on protein 3D structure Supervision of trainees : Maël Sigaroudi, 6 months internship (Master student) Pierre-Edouard Guerin, 3 months internship (Master student) Doha Naga, 3 months internship (Master student) Agathe Delaune, 1 month internship (Master student) Teaching assistant : 198 teaching hours for Master Students - Bioinformatics (sequence similarity analysis, sequence alignment algorithms, trees computation) - Biostatistics (Statistical tests, P-value computation ) - Programming (R, python and UNIX for beginners) - Master 2 projects jury board (x5) Acquired skills: - Strong teamwork collaboration - International scientific communication (oral and writing) - Protein structure expertise - Litterature mining - Big data managment - Machine learning - Statistical analysis - Data mining - Programmation skills - Web-design - Teaching competences - Student coaching, managment and supervising

Certifications

Skill set

Categories