Back to Home
Omar ARIF
Data Scientist / Machine Learning Engineer
About Me
I am a computer engineer specializing in Artificial Intelligence and Data Science, with particular expertise in Deep Learning, Natural Language Processing (NLP), and Computer Vision. My background includes experiences at renowned companies as well as innovative freelance projects.
Passionate about AI techniques and advancements in the field, I explore state-of-the-art methods while developing practical applications. I am committed to deepening my knowledge through reading scientific papers and implementing real-world solutions. Additionally, I have gained solid experience in data engineering, as well as in the production and monitoring of Machine Learning systems, allowing me to effectively solve complex problems.
- Languages: French (fluent), English (fluent, IELTS 8.0), Arabic (fluent)
- Soft Skills: Independence, Curiosity, Team Spirit, Analytical Mindset, Creativity, Ability to Work Under Pressure
Professional Experience
Cdiscount - Data Scientist (Intern)
February 2022 - July 2022
- Project: Automation of product matching between the Cdiscount catalog and competitor sites, using textual (NLP) and visual (Computer Vision) data.
- Key Steps:
- Review of state-of-the-art product matching techniques.
- Collection, analysis, and processing of data from product descriptions and images.
- Development and training of Deep Learning models.
- Optimization and production deployment of the final model.
- Use of Snowflake for processing large datasets.
- Technical Stack: Python, Pytorch, SQL, Snowflake, Pandas, Transformers, Docker, Kubernetes, Power BI.
- Results: Improved product matching rate with a significant reduction in manual efforts.
Digital Development Agency - Python Developer (Intern)
August 2022 - October 2022
- Project: Creation of a scraping API to collect and analyze scientific publications from various websites.
- Responsibilities:
- Design of the API architecture in FastAPI.
- Data collection, cleaning, and creation of dashboards to visualize trends.
- Deployment and performance optimization of the API.
- Technical Stack: Python, FastAPI, BeautifulSoup, HTML, Power BI, Docker, Git.
- Results: Robust and scalable API capable of processing large amounts of data with effective monitoring.
Freelance - AI Developer
2023 - Now
Examples of Projects:
- Development of a voice- and text-based AI agent for e-commerce websites : Built an intelligent agent using Agentic RAG to interact with customers, recommend personalized products, and manage their shopping cart.
- Toxic Content Detection API: Using Machine Learning models to automatically identify toxic messages or images for moderation.
- Super-Resolution Application: Enhancing the quality of blurred areas in surveillance videos using Computer Vision techniques.
- Automated data extraction from scanned invoice images: Developed a system based on lightweight, fine-tuned neural networks to extract key fields from scanned invoices, generating a structured XML file according to company specifications and eliminating manual data entry.
- Real-time dashboard for financial market data: Built an interactive platform to monitor asset prices in real time, visualize technical indicators (moving averages, Bollinger Bands, RSI, etc.), and perform sentiment analysis based on social media discussions.
- Movie Recommendation API: Developing a system suggesting films based on semantic similarity (NLP) (using LLMs) between movie plots in a database and the user’s prompt.
Outlier AI - Coding Expert for AI Training (Freelance)
September 2024 - Now
Responsibilities:
- Optimizing large language models to improve code generation and function-calling capabilities using Reinforcement Learning from Human Feedback (RLHF).
- Providing feedback and ranking AI-generated code, primarily in Python and C++, to enhance model accuracy and real-world applicability.
- Designing and testing prompts to maximize model effectiveness in generating accurate and functional code outputs.
- Supporting the advancement of generative AI performance across diverse, flexible projects within a remote work environment.
GitHub Projects
Here are some projects available on my GitHub:
- Semantic Movie Recommender: A movie recommendation system based on the semantic similarity of plots, using FastAPI and Sentence Transformers. Try it
- AI Cover Letter Generator: A tool that generates personalized cover letters from CVs and job postings, supporting DOCX and PDF formats. Try it
- Day-Night Image Conversion GAN: A project converting day images to night images (and vice versa) using conditional neural networks, with data from the Carla simulator.
- Genetic AI Snake Player: An intelligent Snake player (Deep Learning), trained via a genetic algorithm to optimize the neural network.
- RL-DQN Prioritized Experience Replay: Reinforcement Learning implementation of a Deep Q-Network with Prioritized Experience Replay on the highway-env environment from OpenAI.
Technical Skills
- Languages: Python, Java, C, C++, SQL, Bash, HTML/CSS, x86 Assembly
- Frameworks and Libraries: Pytorch, Tensorflow/Keras, Scikit-learn, Pandas, OpenCV, Transformers
- Tools: Docker, Kubernetes, Git, Power BI, Snowflake, FastAPI, BeautifulSoup
- Methods: Deep Learning, NLP, Computer Vision, Machine Learning, Data Engineering, Recommendation Systems
Education
AIVANCITY (Paris, France) - MSc in Data Engineering and Cloud Computing (M2)
2025–2026
ENSEIRB-MATMECA (BORDEAUX-INP) - Computer Engineering
2019 - 2023
Specialty: Artificial Intelligence
CPGE Moulay Youssef (Rabat, Morocco) - Preparatory Classes for Grandes Écoles
2016 - 2019
Field: Mathematics-Physics (MP)
Lycée Assabil (Rabat, Morocco) - Baccalauréat in Mathematical Sciences
2015 - 2016
Certificates
- Introduction to Machine Learning in Production - Coursera
Certification