Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View helboukkouri's full-sized avatar
:octocat:
:octocat:

Block or report helboukkouri

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
helboukkouri/README.md

Hi, I'm Hicham EL BOUKKOURI 👋

I'm a Senior Data Scientist working on NLP, search, and LLM systems, currently at Qwant, where I build and improve web-scale ranking and search pipelines. My background combines applied AI engineering with research in domain adaptation for language models, including publications such as

  • CharacterBERT, Embedding Strategies for Specialized Domains,
  • Re-train or Train from Scratch?, and
  • Specializing Static and Contextual Embeddings in the Medical Domain Using Knowledge Graphs.

You can find more about my work on Google Scholar and my profile on LinkedIn.

Pinned Loading

  1. character-bert character-bert Public

    Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"

    Python 199 47

  2. awesome-nlp-papers awesome-nlp-papers Public

    A collection/reading-list of awesome Natural Language Processing papers sorted by date.

    35 4

  3. acl_srw_2019 acl_srw_2019 Public

    This is the code for reproducing the experiments from "Embedding Strategies for Specialized Domains: Application to Clinical Entity Recognition" (El Boukkouri et al.)

    Python 7 2

  4. mesh-embeddings mesh-embeddings Public

    Code & pre-trained representations for the Medical Subject Headings (MeSH) thesaurus using node2vec.

    10