You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Sofian ChayboutiSC

Sofian Chaybouti

Machine Learning Engineer

€350/day
4 projects
Paris, FR
3-7 years

Average response time: 1 hour

About Sofian

Ingénieur diplômé de l'ENSTA Paris et du Master MVA de l'ENS Paris-Saclay,
Je suis formé et experimenté en Machine Learning, Deep Learning, NLP et vision.
Je suis actuellement ingénieur de recherche au Noah's Ark Lab où je travaille sur des projets de recherche en deep learning, RL, bandits et optimisation.
  • French

    Native or bilingual

  • English

    Fluent

Remote only
Primarily works remotely

Experience

  • Huawei
    Research Engineer
    TELECOMMUNICATIONS
    August 2021 - Today (4 years and 10 months)
    Boulogne-Billancourt, France
    Noah's Ark Lab
  • Crédit Agricole SA
    Data Scientist
    BANKING AND INSURANCE
    November 2020 - July 2021 (8 months)
    Montrouge, France
    DataLab, Team Semantica.

    - Aspect Based Sentiment Analysis for clients' feedback analysis :
    The goal of this project was to improve the model in production by using transformers models.
    A multitask model that achieves both aspect detection and polarity detection was implemented and put in production.

    - Project on financial contracts analysis from the investment bank CACIB :
    The goal of the project is to build a search engine on the contracts database.
    The contracts are in pdf format, have to be converted to images, ocerized and indexed in an Elasticsearch index.
    The search engine has many features : - Segmentation of the contract into paragraphs, lists, clauses
    - Clause classification that allows to extract specific clauses of interest using tf-idf approaches.
    - Extraction of relevant spans that helps rule whether the contract is transferable using techniques inspired from neural question answering and semantic similarity

    - Training of Multimodal Classification deep learning models of car insurance contracts using textual (ocr) and visual content.

    - Training of Information Extraction deep learning models from car insurance contracts using semantic segmentation and textual embeddings.


    Technological stack : pytorch, tensoflow, transformers, tesseract, elasticsearch, mlflow, gitlab CI/CD, poetry, docker, AWS S3, mlflow, etc.
    NLP Computer Vision Python Pytorch TensorFlow
  • Crédit Agricole SA
    NLP research intern
    BANKING AND INSURANCE
    April 2020 - October 2020 (6 months)
    Montrouge, France
    DataLab, Team Semantica.

    Designed a French textual search engine with a span extraction module.
    Achieved state-of-the-art results on the Phrase-Indexed QA (PIQA) benchmark.

    Achieved state-of-the-art results on the squad-open benchmark.
    (preprint : https://arxiv.org/abs/2012.09766)
    NLP Pytorch Python Research Deep Learning Transfer Learning Multitask Learning Elasticsearch

Recommendations

Be the first to recommend Sofian

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master MVA
    École Normale Supérieure Paris-Saclay
    2020
  • Ingénieur
    ENSTA Paris
    2020

Skill set (12)

Categories