You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Aymen SouidAS

Aymen Souid

Speech and Language Engineer

€125/day
Tunis, TN
3-7 years

Average response time: 1 hour

About Aymen

Hello, I am Aymen, I'm an AI engineer with strong expertise in Language Modeling and ASR systems (Automatic speech recognition)
I have been working in the NLP and ASR fields since 2020 and here are my main skills:

-Fine-tuning ASR models ( Conformer and Whisper)
-Training and deploying large-scale deep learning models
-Implementing tokenization strategies
-Developing efficient web scraping solutions
-Prompt engineering for LLMs
-Integrating n-gram language models with ASR systems
-Large-scale data processing for training and inference
-Specializing in multilingual ASR/LMs development for Arabic, French, and English, leveraging my fluency and expertise in the linguistic nuances of these languages.-Optimized handling of millions of audio files for ASR tasks

For well-defined projects that need expert execution, I’m the right person to bring your vision to life.
  • English

    Native or bilingual

  • French

    Native or bilingual

  • Arabic

    Native or bilingual

Remote only
Primarily works remotely

Experience

  • Cerence Inc
    ASR Engineer
    May 2023 - Today (3 years and 1 month)

    Build Voice Assistant and integrate Language Model to boost accuracy:
    . Train Conformer architecture to build robust ASR system for French, Arabic, Turkish and Hebrew languages.
    . Training models that handle multi-lingual complexities
    . Build tokenizers for rich morphological languages.
    • Experiment with different n-gram orders to find the optimal balance between accuracy and
    computational cost.
    • Use techniques like interpolation, ngram shrinking, pruning, and model interpolation to improve performance. Improve language model scores and perplexity:
    • Employ data augmentation techniques such as backtranslation and noise injection. Minimize word error rate (WER) and sentence error rate (SER):
    • Utilize beam search decoding, lattice rescoring with language models, and confidence estimation.
    • Experiment with different token lexicon patches to improve the accuracy of word recognition. Optimize text-to-speech systems:
    • Use pretrained models to recover diacritics for languages like Arabic. Create statistical tokenizers:
    • Implement techniques like character-based, word-based, and subword tokenization.
    • Consider using OpenNLP tokenizer for languages like Arabic. Resolve client bug tickets:
    • Conduct thorough investigations to identify root causes.
    • Debug code and implement appropriate fixes.
    Implement research paper solutions:
    • Explore neural machine translation models, self-supervised learning techniques, and transfer learning approaches..
  • Inkylab
    Learning Engineer
    February 2021 - May 2023 (2 years and 3 months)
    Tasks:
    • Scraping without being flagged as a bot using rotating proxies approaches
    • Developing scripts and code to automate the data extraction process, and handling data in a way that is accurate by connecting spiders with Django views.
    • Train classificationsNLP model for topic modeling using Pytorch
    • Serve Machine Learning models to production using TorchScript and TorchServe
    • Use the GPT3 model for data collection and augmentation to generate meaningful queries from random
    keywords.
    • Dockerise microservices cloud platform Cloud Run service.
    • custom NLP models on Vertex AI (Batch prediction service)
    • Automate deployment process using terraform .
    • Design and configure database for the backend part using Django Google
  • Inkyfada
    Coach and Speaker
    January 2024 - January 2024
    Media loves tech as a Media loves tech is a competition where teams come up with an innovative ideas and try to realize, founded DW Akademie and Al Khatt: https://www.facebook.com/MediaLovesTech/ Information as I participated to brainstorm about a prototype to design a fake news detection application in the MENA region and especially about language specific tools for knowledge production and data analysis.

Recommendations

Be the first to recommend Aymen

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Baccalaureate
    Faculty of Sciences of Tunis, University of Tunis El Manar
    2022
    Baccalaureate
  • Preparatory Classes Diploma
    Tunis Preparatory Engineering Institute
    2019
    Preparatory Classes Diploma

Skill set

Categories