← Back to list
Middle
Registration: 30.09.2025

Momin Qureshi

Specialization: Founding GenAI Engineer
— Results-driven Fullstack Developer with a strong foundation of 5+ years in software engineering, including Python, JavaScript, C++, and modern web framework (React, Next.js, Django, FastAPI, Flask). — Experienced in leading engineering teams (up to 12 members) to achieve significant performance gains. — Passionate about clean architecture and rapid prototyping to the development of GenAI solutions. — Proficient with cloud platforms and container orchestration, consistently delivering robust and scalable systems ready for AI integration. — Experienced with fine-tuning and evaluation of LLMs. Awards: — 2nd Runner-Up, ACM ICPC Regional (2018). — Winner, FAST ACM GeekWeek (2020). — Winner, COMSATS Code-Red (2019). — Multiple podiums at ACM, IEEE, and inter-varsity coding contests (2017-2019).
— Results-driven Fullstack Developer with a strong foundation of 5+ years in software engineering, including Python, JavaScript, C++, and modern web framework (React, Next.js, Django, FastAPI, Flask). — Experienced in leading engineering teams (up to 12 members) to achieve significant performance gains. — Passionate about clean architecture and rapid prototyping to the development of GenAI solutions. — Proficient with cloud platforms and container orchestration, consistently delivering robust and scalable systems ready for AI integration. — Experienced with fine-tuning and evaluation of LLMs. Awards: — 2nd Runner-Up, ACM ICPC Regional (2018). — Winner, FAST ACM GeekWeek (2020). — Winner, COMSATS Code-Red (2019). — Multiple podiums at ACM, IEEE, and inter-varsity coding contests (2017-2019).

Portfolio

Veritas- Android E-Book Reader

• Built MVP in 90 days; ran UX study with 20 pilot users. • Added AI-based book recommendations increasing reading session length 20%. • Stack: Kotlin, Firebase, GCP

Movie Recommendation System

• Designed & built a chat-based LLM movie recommender using Python, Hugging Face Transformers, Mistral-7B, LoRA/PEFT, and SQLite (FTS5 + vector blobs); ingested >1 M TMDb + IMDb titles with plots, genres, and embeddings for low-latency semantic search and sophisticated multi-filtered recommendations. • Exposed the catalogue through an Mistral-based function-calling API with FastAPI, letting the model auto-fill JSON queries and rank candidates. • Containerised with Docker and deployed inference on GroqCloud, streaming >3 k tokens/s; added token-level logging, cost monitoring, and CI/CD scripts for automated model refreshes.

AI Image Colorization Tool

• Co-developed model colorizing 100 high-res landscape images per minute with 30% higher SSIM than baseline. • Stack: TensorFlow, OpenCV

Phonon- Cross-Platform Image Viewer

• Developed high-performance viewer (<200 ms cold start, <50 MB RAM) for Win/Linux/macOS. • Adopted by friends for design workflows; praised for speed and minimal UI • Stack: Electron, Node.js

Skills

Python
JavaScript
C++
Generative AI
Language Learning Model (LLM)
Natural Language Processing (NLP)
Pydantic AI
Open AI API
AWS
SST
CI/CD
Go
Rust
SQL
NoSQL
PySpark
Apache Airflow
Scrapy
GCP
Docker
Kubernetes
Terraform
Django
React
Node.js
TensorFlow
Pandas
NumPy
Agile

Work experience

Founding GenAI Engineer
since 07.2025 - Till the present day |Tyce.ai
Pydantic AI, FastAPI, Python, PostgreSQL, AWS Lambda, SST, OpenAI API, Docker, CI/CD
● Contributing as one of three core developers building and scaling Tyce’s AI-powered document assistant. ● Implemented the ”Tyce Agent” using PydanticAI, including fine-tuning agent behavior, designing the agent tools, streaming responses, and optimizing. ● Optimized agentic RAG search by implementing contextual retrieval. ● Implemented custom HTML splitter for more stable and style-aware document chunking. ● Scaled data ingestion pipeline to handle 100,000+ in a few minutes by moving ingestion to AWS Lambda. ● Used advanced techniques, including chunking optimization, parallel processing, and selective processing, to cutdown processing costs and time by 20% - 90%, depending on the document. ● Participating in product roadmap and architecture discussions, helping shape both technical direction and user experience. Projects: 1. Movie Recommendation System — Python, LLM (Mistral AI), FastAPI. ● Designed & built a chat-based LLM movie recommender using Python, Hugging Face Transformers, Mistral-7B, LoRA/PEFT, and SQLite (FTS5 + vector blobs); ingested >1 M TMDb + IMDb titles with plots, genres, and embeddings for low-latency semantic search and sophisticated multi-filtered recommendations. ● Exposed the catalogue through an Mistral-based function-calling API with FastAPI, letting the model auto-fill JSON queries and rank candidates. ● Containerised with Docker and deployed inference on GroqCloud, streaming >3 k tokens/s; added token-level logging, cost monitoring, and CI/CD scripts for automated model refreshes. 2. Veritas - Android E-Book Reader — Kotlin, Firebase, GCP. ● Built MVP in 90 days; ran UX study with 20 pilot users. ● Added AI-based book recommendations increasing reading session length 20%. 3. Phonon - Cross-Platform Image Viewer — Electron, Node.js. ● Developed high-performance viewer (<200 ms cold start, <50 MB RAM) for Win/Linux/macOS. ● Adopted by friends for design workflows; praised for speed and minimal UI. 4. AI Image Colorization Tool — TensorFlow, OpenCV. ● Co-developed model colorizing 100 high-res landscape images per minute with 30% higher SSIM than baseline.
Software Engineer / Team Lead
09.2022 - 06.2025 |Turing
Python, PyTorch, Hugging Face, SQL, Airflow, Docker, Kubernetes, GitHub, GitOps, Terraform, AWS, GCP, Azure, ServiceNow.
● Architected data pipelines that curated and transformed 10 000+ multimodal samples (2M+ tokens) for training state-of-the-art LLMs using the latest training formats (RLHF, SFT, DPO, ORPO). ● Fine-tuned and evaluated code-generation and chat models with Hugging Face Transformers, raising pass-@-1 accuracy to 70% and cutting hallucinations by 18%. ● Built an automated evaluation harness that reduced model validation cycles by 40%. ● Rolled out semi-supervised active-learning workflows and auto-labeling pipelines that slashed manual QA effort by 5x. ● Led a cross-functional team of 12 data engineers, mentoring them on prompt engineering, dataset design, and scalable MLOps best practices. ● Implemented robust PII and toxicity detection in Python & SQL, achieving 99,2% precision and ensuring SOC 2 / GDPR compliance. ● Deployed GenAI services on Kubernetes with GitOps practices using GitHub CI/CD pipelines, streamlining production rollout and rollback. ● Integrated models with external platforms like ServiceNow to enable real-time AI-assisted workflows. ● Provisioned cloud infrastructure on AWS, GCP, and Azure using Infrastructure as Code (IaC) to deploy and manage GenAI training utilities-including evaluation harnesses, data pipelines, and model serving-while maintaining a 99,99% SLA.
Software Engineer
07.2021 - 08.2022 |Arbisoft
Django, React, Scrapy, AWS (EC2, DynamoDB, Redshift, Lambda, Glue), Airflow, GitLab, Kubernetes
● Played a key role in developing and maintaining robust cloud-based ETL pipelines for a leading fashion analytics platform, ensuring a reliable flow of 4 TB+ raw data daily. ● Collaborated with a fully remote team across 3 time zones, leveraging sprint rituals and clear documentation to hit 95% of roadmap milestones on schedule. ● Re-engineered the extraction component of a fashion analytics pipeline (20k+ lines), shrinking daily runtime by 36% and saving $45k per year in compute. ● Built and maintained full-stack features: React dashboards for merchandising teams, backed by Django REST API endpoints and AWS API Gateway. ● Ensured data integrity and performance with automated tests (PyTest + Cypress), reaching 85% coverage and daily CI runs. ● Deployed micro-frontends and Python services via Docker/Kubernetes and Terraform, enabling blue-green releases with <5 min rollback.
Consultant - Data Science & Game Development
01.2019 - 07.2021 |NDA
Python, C++, Rust, Go, JavaScript, C#, Java, Unity, PySpark, AWS, GCP
● Delivered 100+ contracts ranging from predictive analytics to indie games. ● Built real-time analytics platform on GCP handling 60k events/sec; reduced query latency from 2 s to 200 ms. ● Optimized PySpark pipelines, lowering cluster costs 22% through improved partitioning.
Web Developer
06.2019 - 08.2019 |Women In Struggle for Empowerment (WISE)
Python, Django, MySQL, AWS EC2, AWS RDS, cPanel
● Improved API response time 55% and boosted uptime to 99,8% by refactoring Django ORM queries. ● Migrated the website’s hosting and database from shared cPanel to AWS EC2 and Amazon RDS, improving reliability and scalability.

Educational background

Computer Science (Bachelor’s Degree)
2017 - 2021
National University of Computer & Emerging Sciences (FAST - NUCES)

Languages

EnglishNativeUrduNativeHindiNativeJapaneseElementaryPunjabiAdvanced