Senior AI Software Engineer with 4+ years of experience building scalable backend systems and deploying applied AI in production environments, including real-time systems serving tens of thousands of users. Strong background in FastAPI backends, real-time systems, and LLM-powered workflows. Experienced in cloud-native architectures (AWS, Docker, Kubernetes) and self-hosted AI inference. Motivated to work on high-impact, globally distributed engineering teams and open to relocation across Europe.



Designed and implemented OptySleep recommendation engine using Redis caching and A/B testing, increasing iOS app engagement by ~70% and improving retention and subscription renewals.
Delivered production-grade IoT backend for Staging Pro using Django, PostgreSQL, and Redis, processing live telemetry with <100ms latency for real-time race coordination.
Built PseudoCompiler, a real-time WebSocket-based platform (FastAPI, AWS) scaling to 50,000+ concurrent users with <50ms latency, enabling higher user throughput at reduced infrastructure cost.
Led backend and AI/IoT workstreams for a 3-engineer team, introduced containerized deployments (Docker, Kubernetes), and automated ops workflows with RAG + n8n, saving ~8 hours/week.
Prototyped lightweight, quantized LLMs (PyTorch, Ollama) for secure on-device inference on IoT hardware; currently evaluating FunctionGemma for agentic workflows.
Working on new products — Voca: the AI voice & chat agent with sales CRM, and CarBNB — a car rental app for Gulf and South Asian markets.

Architected hybrid systems (monolithic + microservices) on Azure with Kubernetes, scaling reliably to 100k-1M+ users.
Built ViralMe AI Video Editor (viralme.today), using Remotion, OpenCV, and FFMPEG reducing turnaround from hours to minutes.
Deployed self-hosted LLMs and Whisper3 via Ollama CLI, optimized GPU usage for cheaper inference.
Developed multi-agent video pipelines (YOLOv12, DeepSORT, SAM2 + RabbitMQ orchestration + LLM agents) to enable async detection, tracking and semantic summarization — achieved near-real-time throughput for production feeds (~20–25 FPS on target infra) and automated tagging.
Automated workflows and built connectors and background workers that cut manual reconciliation and data sync tasks by ~50%, improving reliability and reporting accuracy projects like Fortify ERP (fortify.biz).
Designed REST APIs & microservices (FastAPI/Django, Postgres, Redis, Firebase) and improved reliability & throughput.
Implemented CI/CD (GitHub Actions + Docker), shortened release cycles ~40%.
Migrated services to cloud infra; built dashboards improving issue detection by ~30%.

Built ML pipelines to support AI-powered breast cancer diagnosis, achieving 92% accuracy and reducing review time by 87%.
Worked in a lab incubation environment tied to FYP research in collaboration with Aga Khan University Hospital.
Developed tools for automated triage, sorting, and tagging of radiology images.

Prepared and analyzed large volumes of breath & cough data for early COVID-19 detection using signal processing.
Achieved accuracy of 89% in early-built prototype.
Developed a prototype of a COVID-19 detection solution using Flask & Ensembling techniques, under the supervision of Stanford University & DetectCovid.
My experience as a Microsoft Student Ambassador has been incredibly rewarding, simultaneously supporting two communities. I relish being involved with both GDSC and MLSA, navigating various technologies with my peers. It's an entirely new kind of adventure for me.
Co-established the student developers community in 2019 at our campus. Led the community 2020 to 2021. Hosted total 150 workshops with 100 as self-trainer in AI. Trained over 1000+ students myself alone.
Selected peer‑reviewed work, conference papers, a published book, and thesis with direct DOI and publisher links.
M. H. Shahbaz, Nadeem Ullah, M. Ahmed
Spectrum of Engineering Sciences ISSN (e)3007-3138 (p)3007-312X vol. 4, no. 5, pp 1724–1734, 2026
Ali, U., Kandhro, I.A., Ahmed R.S., Khan, A.A. Shahbaz, M.H. Osama, M.
International Journal of Emerging Sciences and Digital Economy (IJESDF), vol. 17, no. 3, pp 391-403, April 2025
K. Mahboob, M. H. Shahbaz, F. Ali, and R. Qamar
VFAST Transactions on Software Engineering, vol. 11, no. 2, pp. 249–255, Jun. 2023
M. H. Shahbaz, Zain-Ul-Abidin, K. Mahboob and F. Ali
2023 7th International Multi-Topic ICT Conference (IMTIC), Jamshoro, Pakistan, 2023, pp. 1-7
T. Mubeen, Zain-Ul-Abidin, M. H. Shahbaz, P. O. Roth and M. A. L. Nieto
2023 Global Conference on Wireless and Optical Technologies (GCWOT), Malaga, Spain, 2023, pp. 1-7

Microsoft for Startups · Founders Hub (Transpify)
Awarded $25,000 in Azure Credits by Microsoft for Transpify startup under Founders Hub.

Sir Syed University of Engineering and Technology
First position in the final year project competition at Sir Syed University of Engineering and Technology.

GDSC & Microsoft Learn Student Ambassadors
Mentored 1000+ students in AI/ML via workshops and community events at Google Developer Student Clubs and MLSA. – 2020-2022




Microsoft
Issued Jan 2021
Credential ID wnYYD-48DY

Microsoft
Issued Dec 2020
Credential ID wnqmz-48Eq





IBM
Issued May 2020
Credential ID MBJSPDSFEMQA


IBM
Issued May 2020
Credential ID 4A2F9HRJJC9G