Hi, I'm Aayan 👋
Artifical Intelligence/Machine Learning Developer & Researcher
AM

About

Hey there! I'm Aayan and I work on building AI projects, from finetuning models like OdysseyXL to developing systems such as Maverick Search. Whether its deep learning, experimenting with model optimization, or creating something entirely new, I'm always exploring something. When I'm not coding or training models, you'll probably find me watching Formula 1.

Education/Certificates

H

Harvard University - CS50x: Introduction to Computer Science

2025 - Present
This is CS50x , Harvard University's introduction to the intellectual enterprises of computer science and the art of programming for majors and non-majors alike, with or without prior programming experience. An entry-level course taught by David J. Malan, CS50x teaches students how to think algorithmically and solve problems efficiently. Topics include abstraction, algorithms, data structures, encapsulation, resource management, security, software engineering, and web development. Languages include C, Python, SQL, and JavaScript plus CSS and HTML. Problem sets inspired by real-world domains of biology, cryptography, finance, forensics, and gaming.
H

Hugging Face - LLM Course

2025 - 2025
The Hugging Face LLM Course is a practical, hands-on introduction to working with large language models (LLMs) using the Hugging Face ecosystem. It covers key concepts like transformers, tokenization, model inference with pipelines, fine-tuning on custom datasets, evaluation, and deployment. Designed for developers and ML practitioners with basic Python skills, the course teaches how to leverage state-of-the-art models for tasks such as summarization, classification, and translation. Learners get to explore the Hub, train and share models, and optimize inference with tools like Text Generation Inference (TGI) and accelerate, all through interactive Colab notebooks and real-world examples.
T

TAFE NSW - Introduction to Artificial Intelligence

2025 - 2025
Completed a 2.5-hour self-paced online Microskill course introducing the fundamentals of Artificial Intelligence, with no prior technical knowledge required. Gained foundational understanding of how AI learns from data, explored real-world applications across various industries, learned key AI terminology, and received insights from industry professionals on starting a career in AI. The course also addressed common myths and misconceptions surrounding AI. Successfully completed all modules and assessments to earn a certificate of completion.
C

CodeSignal - Building Neural Networks with PyTorch

2024 - 2025
Master PyTorch with this learning path, designed for those experienced in Python and machine learning. From tensor basics to advanced modeling, it includes practical exercises focused on real-world datasets, such as the wine dataset, enhancing your deep learning skills through PyTorch.
S

Sololearn - Python Developer

2024 - 2025
Python is the world's fastest growing programming language is easy to read, learn and code. You'll learn to build interactive programs and automate your tasks, analyze and visualize even the most complex data and create AI and machine learning models. No previous coding experience needed.

Skills

Python
Vercel
NumPy
PyTorch
TensorFlow
scikit-learn
Docker
Keras
GCP
Azure
AWS
Pandas
Unsloth
Transformers
Diffusers
PEFT
Jupyter Notebooks
My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

OdysseyXL

OdysseyXL

Fine-tune of Stability.ai's SDXL text-to-image model for enhanced realism and better image generation

SDXL
Low-Code
Stability.ai
Cloud Training
Diffusers
Python
Athena

Athena

Athena is a high-performance LLM that is designed to excel in most STEM areas as well as general NLP tasks!

Python
Low-Code
text-to-text
NLP
Research
Maverick Search

Maverick Search

Maverick Search is an open-source AI search engine designed to run locally with Ollama. This project is designed to be an open-source alternative to major AI search engines such as Perplexity and etc.

Python
Low-Code
text-to-text
NLP
Research
Senna

Senna

Senna is a small but powerful open-source computer vision model based on YOLOv11 for detecting Formula 1 Cars

Python
Low-Code
Ultralytics
Computer Vision
Research
YOLO
Publications

I like building & researching things

  • V

    Vera-V1: Enhancing Multilingual Language Models with Group Relative Policy Optimisation (GRPO)

    This is a research project which I led the development of the Vera model family. The purpose of this research project was to analyse how we can improve non-reasoning multilingual LLMs through reinforcement specifically Group Relative Policy Optimisation (GRPO).
Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.