AI Engineer at Genesis Group Inc. / Genesis AI Garage

Mostafa Rafiur Wasib

Machine Learning Engineer | LLM Systems, RAG, Evidence Extraction & MLOps

I build production-focused AI and ML systems across LLM workflows, document processing, structured extraction, tabular ML, monitoring, and cloud-backed deployment. I currently work as an AI Engineer at Genesis AI Garage, building practical AI/ML solutions for startup clients and founders.

St. John's, NL, Canada | mostafa.soumik73@gmail.com | 709-219-4278

LLM / RAG Systems Document & Evidence Extraction Tabular ML & Forecasting Monitoring & MLOps Cloud AI Deployment

About

Production-focused AI and ML engineering

I am an AI Engineer and AWS-certified AI professional with experience across machine learning, data analytics, applied research, and client-facing technical delivery. My current work focuses on building AI/ML systems for startup clients and founders, including RAG platforms, document processing, structured evidence extraction, tabular ML, anomaly detection, forecasting, APIs, dashboards, cloud deployment, monitoring, and technical handover.

Because most recent client work involves proprietary data and private production code, I present those projects as professional case studies focused on the problem, my contribution, tools, and high-level outcome.

Role Fit

Built for AI, ML, data science, and production delivery roles

My work sits at the intersection of model development, data engineering, LLM workflows, cloud deployment, monitoring, and client-facing technical delivery.

AI Engineer

RAG systems, document intelligence, structured outputs, APIs, cloud-backed deployment, and production handover.

Machine Learning Engineer

Tabular ML, feature engineering, forecasting, anomaly detection, offline experiments, evaluation, and monitoring workflows.

Data Scientist

SQL/Python analysis, KPI dashboards, model evaluation, experiment summaries, data validation, and stakeholder-ready reporting.

LLM / RAG Engineer

Ingestion, chunking, embeddings, semantic retrieval, metadata filtering, citation-grounded responses, and evidence extraction.

MLOps Engineer

FastAPI, Docker, CI/CD, logging, Grafana dashboards, drift checks, alerts, runbooks, and cloud deployment planning.

Skills

Technical toolkit for applied AI systems

Machine Learning

  • Tabular classification
  • Feature engineering
  • Offline experimentation
  • Model evaluation
  • Anomaly detection
  • Forecasting
  • Recommendation systems
  • XGBoost
  • Gradient Boosting
  • Random Forest
  • scikit-learn
  • PyTorch
  • TensorFlow/Keras

LLM / GenAI

  • RAG
  • Structured extraction
  • Evidence extraction
  • Prompt engineering
  • LangGraph
  • LangChain
  • OpenAI API
  • Anthropic Claude
  • AWS Bedrock
  • Citation-grounded responses

Document & Unstructured Data

  • PDF ingestion
  • Text parsing
  • OCR/image-to-text workflows
  • Chunking
  • Metadata extraction
  • Embeddings
  • Semantic search
  • Vector databases
  • Structured output generation

MLOps / Production

  • FastAPI
  • Docker
  • GitHub Actions
  • CI/CD
  • MLflow-style tracking
  • Logging
  • Model monitoring
  • Drift checks
  • Grafana dashboards
  • Alerts
  • Runbooks
  • Cloud deployment

Cloud & Data

  • AWS
  • Azure
  • GCP
  • S3
  • Athena
  • Lambda
  • EC2
  • IAM
  • CloudWatch
  • Glue
  • Lake Formation
  • PostgreSQL/RDS
  • Cloud Run
  • Cloud SQL
  • pgvector
  • SQL
  • REST APIs
  • ETL/ELT

Developer Tools

  • Cursor
  • Claude Code
  • GitHub Copilot
  • Debugging
  • Code review
  • Test generation
  • Documentation

Production AI Work

Production AI / Client-Facing Work

Most of my recent Genesis AI Garage work involves proprietary client data and private production code. The case studies below summarize the problem, my contribution, tools used, and outcome at a high level without exposing private data, source code, credentials, or client-specific implementation details.

Nditive

Environmental Forecasting & Anomaly Detection

Environmental / industrial sensor analytics

Problem
Build forecasting and anomaly-detection workflows for operational sensor data.
My contribution
Worked on CO2/environmental forecasting, feature engineering, anomaly detection, thresholding, offline evaluation, dashboard reporting, and monitoring handover.
Outcome
Built practical ML workflows and monitoring views to support operational insight and reliability.
  • Python
  • scikit-learn
  • XGBoost
  • Gradient Boosting
  • Random Forest
  • LSTM/autoencoder approaches
  • AWS Athena
  • S3
  • Grafana

Public summary only. Proprietary data and production code are not publicly shared.

Weeva

Document Intelligence & RAG Workflow

Document-heavy business workflows

Problem
Help users search unstructured documents and convert relevant information into structured, usable outputs.
My contribution
Built RAG workflows for document ingestion, parsing, chunking, embedding generation, semantic retrieval, metadata filtering, and structured LLM responses.
Outcome
Converted complex unstructured information into searchable, citation-grounded outputs for client decision-making.
  • Python
  • FastAPI
  • LangGraph/LangChain
  • OpenAI
  • Anthropic Claude
  • AWS Bedrock
  • PostgreSQL/pgvector
  • AWS Lambda
  • S3
  • Docker

Public summary only. Proprietary data and production code are not publicly shared.

StarLuv

Agentic Recommendation & Ranking Workflow

Retail / product discovery

Problem
Improve product discovery and recommendation reliability over a large product catalog.
My contribution
Tested and improved retrieve-rank-reason workflows, product metadata handling, ranking logic, behavior-learning support, and frontend-facing demo outputs.
Outcome
Validated the workflow on an 18,000+ product catalog and a 24/24 passing live test panel before demo and rollout planning.
  • Python
  • LangGraph-style workflows
  • LLM APIs
  • Retrieval/ranking logic
  • Product metadata
  • Evaluation checks

Implementation details are kept high level.

Artinus

Artinus B2B AI/Data Analytics Workflow

B2B AI and data analytics

Problem
Help business users combine information from multiple data sources and convert it into useful insights.
My contribution
Supported AI workflow design, retrieval logic, structured outputs, monitoring documentation, and production-readiness planning.
Outcome
Supported integration-ready AI/data analytics workflows for business insight generation.
  • Python
  • SQL
  • Cloud services
  • Retrieval workflows
  • Structured output patterns
  • Documentation
  • Monitoring plans

Public summary only. Proprietary data and production code are not publicly shared.

Nucliq

Healthcare / Biotech Knowledge Assistant

Healthcare / biotech AI

Problem
Improve access to complex domain knowledge while keeping retrieval structured and maintainable.
My contribution
Supported RAG and AI infrastructure workflows involving secure retrieval design, document processing, structured outputs, logging, and deployment planning.
Outcome
Helped make complex knowledge easier to retrieve, summarize, and use in client workflows.
  • Python
  • FastAPI
  • Vector search
  • Cloud services
  • Docker
  • Logging
  • Monitoring workflows

Public summary only. Proprietary data and production code are not publicly shared.

Soma Health

Healthcare AI Workflow & Documentation Platform

Healthcare AI / clinical workflow support

Problem
Support AI workflows involving document/report generation, privacy-aware deployment planning, and secure system design.
My contribution
Helped with architecture planning, logging decisions, deployment review, cost-control thinking, and documentation for maintainable AI infrastructure.
Outcome
Supported a safer and more maintainable AI workflow for healthcare-related use cases.
  • Python
  • Cloud services
  • AWS planning
  • Logging
  • Monitoring
  • Deployment documentation

Public summary only. Proprietary data and production code are not publicly shared.

DuaTask

Cloud Deployment & Production Readiness

Business operations / cloud deployment

Problem
Improve deployment readiness, health checks, and handover for an AI-enabled platform.
My contribution
Supported codebase review, Azure deployment planning, startup/health-check verification, documentation, and production-readiness review.
Outcome
Helped improve system readiness for deployment and future maintenance.
  • Azure
  • Python
  • Deployment scripts
  • Health checks
  • Documentation
  • Monitoring planning

Details are kept high level.

Audyse

Audio AI / DSP Feasibility

Audio AI for noisy environments

Problem
Explore AI and DSP approaches for denoising and sound-event detection under real-time constraints.
My contribution
Researched denoising approaches, emergency sound detection ideas, DSP vs AI tradeoffs, latency constraints, and model feasibility.
Outcome
Helped clarify practical implementation tradeoffs for future product development.
  • Python
  • Audio ML research
  • DSP/AI feasibility analysis
  • Model optimization review

Details are kept high level.

OmaScan

3D Assessment & Scan Intelligence

Accessibility / 3D home assessment workflows

Problem
Explore AI approaches for interpreting scan-related context, structured outputs, and assessment workflows.
My contribution
Supported feasibility research around GLB/scan context, structured JSON understanding, object-detection targets, render-and-lift approaches, and rule-based/LLM-assisted workflows.
Outcome
Helped evaluate practical AI directions for future product development.
  • Python
  • 3D/vision research
  • Structured data processing
  • LLM workflow planning

Public summary only. Proprietary data and production code are not publicly shared.

Experience

Professional timeline

Jan 2025 - Present

AI Engineer

Genesis Group Inc. / Genesis AI Garage | St. John's, NL

Building production-focused AI and ML systems for startup clients and founders, including RAG platforms, document processing, structured extraction, tabular ML, anomaly detection, forecasting, APIs, dashboards, monitoring, and handover. Supporting team-lead responsibilities across AI Garage projects, including coordinating technical work, guiding junior contributors, and aligning delivery with client needs.

Sep 2024 - Dec 2024

AI Project Intern

Genesis Group Inc. / Genesis AI Garage | St. John's, NL

Worked on anomaly detection, forecasting, model evaluation, AWS Athena/Grafana reporting, deployment notes, and monitoring documentation for client-facing AI workflows.

Jun 2024 - Aug 2024

Data Analyst Intern

Stella's Circle & Choices for Youth | St. John's, NL

Cleaned and transformed datasets, prepared integration-ready JSON outputs, built KPI dashboards, and improved reporting quality through validation and documentation.

Feb 2024 - Apr 2024

Community Researcher Intern

Canadian Council on Rehabilitation and Work | St. John's, NL

Collected and analyzed employment-access data and prepared accessibility-focused research summaries and recommendations.

Oct 2023 - Dec 2023

Research Assistant

Visual and Analytic Computing Lab, Memorial University of Newfoundland | St. John's, NL

Supported applied AI and data research through preprocessing, model experimentation, visualization, image/medical-data analysis, and documentation.

Jan 2022 - Jul 2023

IT Admin & Data Analyst

Vision One Consultancy | Phnom Penh, Cambodia / Remote

Built dashboards, automated ETL workflows, improved data quality, analyzed user logs, and supported reporting and data-access workflows.

Sep 2019 - Feb 2020

Data Science Intern

Maxis Berhad | Kuala Lumpur, Malaysia

Performed data preparation, outlier detection, exploratory analysis, dashboard development, and map/API demo workflow prototyping.

GitHub Work

Curated public repositories

My GitHub is organized as a technical evidence layer for current portfolio work, applied AI/ML systems, academic projects, and supporting experiments. The repositories highlighted here are the most relevant to AI Engineer, Machine Learning Engineer, Data Scientist, LLM/RAG, and MLOps roles.

Portfolio

AI/ML Engineering Portfolio

Static GitHub Pages portfolio with production AI case studies, CV, public project map, and professional positioning.

  • HTML
  • CSS
  • GitHub Pages
View repository

ML / Monitoring

Gas Sensor Classification & Anomaly Detection

Public repo aligned with sensor analytics, anomaly detection, forecasting, and operational monitoring themes.

  • Jupyter Notebook
  • Anomaly Detection
  • Sensor Data
View repository

Computer Vision

Multimodal Emotion Recognition System

Applied emotion recognition project useful for showing computer vision, multimodal AI, and HCI experience.

  • Python
  • Computer Vision
  • HCI
View repository

Accessibility AI

AI Accessibility Extension

Visual-to-audio accessibility project with image captioning and assistive workflow context.

  • Accessibility
  • Image Captioning
  • Assistive AI
View repository

Bio / Ecology ML

Fungal Habitat Prediction

Machine learning project for habitat prediction using taxonomy and observation data.

  • Jupyter Notebook
  • Classification
  • Ecology Data
View repository

NLP / ML Tools

Text Summarization & Sentiment Analysis

Compact NLP projects that show preprocessing, classical ML, and text analytics foundations.

  • Python
  • NLP
  • scikit-learn
Summarization repo Sentiment repo

Public Projects

Selected Academic & Public Projects

These projects support the production case studies above and keep the strongest academic and public work easy to scan. Each card links to the local project detail page and the matching GitHub repository where available.

CO2 Emissions Forecasting

Compact public summary of forecasting and anomaly-detection work; detailed production framing appears above.

Python, AWS Athena, Grafana, XGBoost, Random Forest

Anomaly Detection Model

Sensor-data anomaly detection exploration with thresholding and model-evaluation context.

Python, PyTorch, LSTM autoencoder, evaluation metrics

Real-Time Face, Gender, and Emotion Detection

Computer-vision prototype for real-time face, age, gender, and emotion analysis.

Python, OpenCV, Caffe, Keras

Fungal Habitat Prediction

Ecological ML project predicting fungal habitat classes from taxonomy and observation data.

Python, NumPy, scikit-learn, Random Forest, LightGBM

DNA Sequence Classification

Bioinformatics classification project comparing sequence encodings for DNA data.

Python, scikit-learn, Gradient Boosting, k-mer encoding

Automated Facemask Detection

Image-processing project exploring mask detection in public-space imagery.

MATLAB, image processing, Viola-Jones, HSV segmentation

Text Summarization Tool

Extractive text summarization project for ranking and selecting important sentences.

Python, NLTK

Sentiment Analysis Tool

Text classification workflow for sentiment analysis with preprocessing and model training.

Python, scikit-learn, NLP preprocessing

Image Classification with Neural Networks

Neural-network image classification project using classic academic datasets.

Python, TensorFlow, neural networks

MATLAB Image Processing

Image-processing exercises covering edge detection, filtering, and visual analysis.

MATLAB, edge detection, filtering

Real-Time Visual-to-Audio Accessibility Tool

Accessibility prototype converting visual scene context into audio support.

Python, TensorFlow, OpenCV

Emotion Recognition for HCI

Human-computer interaction project exploring emotion recognition from multimodal cues.

Python, OpenCV, TensorFlow

Education

Academic background

Master of Artificial Intelligence

Memorial University of Newfoundland, Canada

Bachelor of Computer Science, Artificial Intelligence

University of Malaya, Malaysia

Certifications

Licenses and certifications

Fidelity Investments - Customer Service Job Simulation

Forage

Credential ID: 7qx54poyG85XB5h2H

AI For Everyone

Coursera

Credential ID: BNHHR2YN693L

Getting Started with AWS Machine Learning

Coursera

Credential ID: UVRGD9SJTMF2

Introduction to Data Analytics for Business

Coursera

Credential ID: ZWM3N6TKGKR9

AWS Fundamentals: Going Cloud-Native

Coursera

Credential ID: FLA8XC8UE6H5

Leadership Gold

Abundant Impact

Credential ID: 55684331599925

Contact

Open to AI/ML engineering conversations

Based in St. John's, NL, Canada and open to remote Canada AI/ML roles.

mostafa.soumik73@gmail.com LinkedIn GitHub Download CV Phone: 709-219-4278