Hello, my name is
Abdelghaffour Mouhsine
Data Scientist & Python and AI Developer
- abdelghaffourm@gmail.com
- +212 682 103381
- @AbdelghaffourMouhsine
- @abdelghaffour-mouhsine

About me
I am a Data Scientist with expertise in Web Scraping, Web Automation, Natural Language Processing (NLP), and Computer Vision... I have a strong track record of delivering quality work and ensuring client satisfaction.
What I do
From understanding your requirements, designing a blueprint and delivering the final product, I do everything that falls in between these lines.
NLP & LLM
Leveraging advanced language models and natural language processing techniques to build intelligent text-based solutions.
Computer Vision
Applying state-of-the-art algorithms to extract insights from images and videos, including object detection and image processing tasks.
Web Scraping & data Collecting
Utilizing tools like BeautifulSoup, Scrapy, Selenium and Selenium Grid to extract valuable data from websites, even those with complex JavaScript rendering.
web devlopement
Developing full-stack solutions with a focus on Django, Python, React, and Angular, ensuring a seamless and user-friendly experience.
My Experience
08/2023 - Today
KPO BUSINESS
Data Scientist : Automation and AI for Lead Generation
As part of a project with KPO BUSINESS, I designed and developed an automated B2B prospecting solution in multiple stages, utilizing scraping technologies, artificial intelligence, and automation.
1- Fundraising Data Scraping:
Scraped a list of 1300 startup fundraising events in France from a website using Selenium.
2- Contact Information Extraction:
+ Scraped contact details of each startup directly from their websites.
Used a BERT model with 99% accuracy to classify contact pages.
+ Employed the OpenAI API to extract relevant contact information from the identified pages.
3- LinkedIn Search Automation:
Automated Google searches to locate the LinkedIn pages of the startups.
Scraped company and employee information from LinkedIn.
Identified key profiles (founders, co-founders, CEO, CTO, HR Director).
08/2024 - 10/2024
Mham-الخليج مهام
Freelance Data Scientist : Scrape millions of products and remove the logos from product images.
+ Scraping 1 million products along with their images.
+ Selenium and Selenium Grid to scrape in parallel using 100 threads.
+ YOLOv8 for automatically detecting and removing logos from the images.
+ 100 proxies to avoid IP blocking, allowing me to send 4 million requests without any
issues.
+ AWS cloud infrastructure, with an EC2 instance for both scraping and image
processing, and S3 for image storage.
+ Docker to containerize the entire project, ensuring portability and scalability.
01/2024 - 06/2024
Imperium
Data Scientist : NLP and LLMs for Automating Contact Information Web Scraping
+ Scraping thousands of websites, including static sites and those with JavaScript rendering, using BeautifulSoup, Scrapy and Selenium technologies. + Implementing parallel programming with Selenium Grid and threading to accelerate the scraping process and improve efficiency. + Integrating AI models based on transformers (BERT with 99% accuracy) and LLMs (LLaMa 3) to enhance the accuracy of extracting contact information from websites.
Skills
Machine learning & Deep Learning:
KNN, K-means, ANN, CNN, GANs, Transformers
Web Scraping & automation:
BeautifulSoup, Scrapy, Selenium, Selenium Grid
Computer vision & NLP :
OpenCV, YOLO, OCR, Spacy, Transformers
LLMs:
OpenAI API, Open Source LLMs, LongChain, Ollama
DevOps & Cloud :
AWS, Docker, Git, Github, Gitlab, Linux, Maven
Programming languages :
Python, Java, JavaScript, TypeScript
web devlopement
Back-end : Django, Spring Boot, Hibernate, JPA, JEE, JUnit5, Mockito, PHP & Laravel
Front-end : Angular, JavaScript, JQuery, Bootstrap, HTML & CSS
CMS: WordPress