Hello, my name is

Abdelghaffour Mouhsine

Data Scientist & Python and AI Developer

About me

I am a Data Scientist with expertise in Web Scraping, Web Automation, Natural Language Processing (NLP), and Computer Vision... I have a strong track record of delivering quality work and ensuring client satisfaction.

What I do

From understanding your requirements, designing a blueprint and delivering the final product, I do everything that falls in between these lines.

NLP & LLM

Leveraging advanced language models and natural language processing techniques to build intelligent text-based solutions.

Computer Vision

Applying state-of-the-art algorithms to extract insights from images and videos, including object detection and image processing tasks.

Web Scraping & data Collecting

Utilizing tools like BeautifulSoup, Scrapy, Selenium and Selenium Grid to extract valuable data from websites, even those with complex JavaScript rendering.

web devlopement

Developing full-stack solutions with a focus on Django, Python, React, and Angular, ensuring a seamless and user-friendly experience.

My Experience

08/2023 - Today

KPO BUSINESS

Data Scientist : Automation and AI for Lead Generation

As part of a project with KPO BUSINESS, I designed and developed an automated B2B prospecting solution in multiple stages, utilizing scraping technologies, artificial intelligence, and automation.
1- Fundraising Data Scraping:
Scraped a list of 1300 startup fundraising events in France from a website using Selenium.
2- Contact Information Extraction:
+ Scraped contact details of each startup directly from their websites. Used a BERT model with 99% accuracy to classify contact pages.
+ Employed the OpenAI API to extract relevant contact information from the identified pages.
3- LinkedIn Search Automation:
Automated Google searches to locate the LinkedIn pages of the startups. Scraped company and employee information from LinkedIn. Identified key profiles (founders, co-founders, CEO, CTO, HR Director).

08/2024 - 10/2024

Mham-الخليج مهام

Freelance Data Scientist : Scrape millions of products and remove the logos from product images.

+ Scraping 1 million products along with their images.
+ Selenium and Selenium Grid to scrape in parallel using 100 threads.
+ YOLOv8 for automatically detecting and removing logos from the images.
+ 100 proxies to avoid IP blocking, allowing me to send 4 million requests without any issues.
+ AWS cloud infrastructure, with an EC2 instance for both scraping and image processing, and S3 for image storage.
+ Docker to containerize the entire project, ensuring portability and scalability.

01/2024 - 06/2024

Imperium

Data Scientist : NLP and LLMs for Automating Contact Information Web Scraping

+ Scraping thousands of websites, including static sites and those with JavaScript rendering, using BeautifulSoup, Scrapy and Selenium technologies.
+ Implementing parallel programming with Selenium Grid and threading to accelerate the scraping process and improve efficiency.
+ Integrating AI models based on transformers (BERT with 99% accuracy) and LLMs (LLaMa 3) to enhance the accuracy of extracting contact information from websites.

Skills

Machine learning & Deep Learning:

KNN, K-means, ANN, CNN, GANs, Transformers

Web Scraping & automation:

BeautifulSoup, Scrapy, Selenium, Selenium Grid

Computer vision & NLP :

OpenCV, YOLO, OCR, Spacy, Transformers

LLMs:

OpenAI API, Open Source LLMs, LongChain, Ollama

DevOps & Cloud :

AWS, Docker, Git, Github, Gitlab, Linux, Maven

Programming languages :

Python, Java, JavaScript, TypeScript

web devlopement​

Back-end : Django, Spring Boot, Hibernate, JPA, JEE, JUnit5, Mockito, PHP & Laravel
Front-end : Angular, JavaScript, JQuery, Bootstrap, HTML & CSS
CMS: WordPress

Certifications