Zahra Gharaee

Zahra Gharaee

Ph.D. in Cognitive Science

Toronto, ON, Canada

About me

I am a machine learning professional with over seven years of experience leading cross-industry projects and working in cross-functional teams. I specialize in building, debugging, and deploying resilient machine learning and deep learning models, with a strong focus on scalability, usability, and practical impact. My work has led to the release of widely used datasets and open-source tools, and I have published at top-tier venues such as NeurIPS and ICML. I have designed robust data pipelines that streamline data ingestion and preprocessing, significantly improving data accessibility and efficiency. I thrive in collaborative environments, where I communicate technical ideas clearly to diverse stakeholders and align project goals across teams. I actively mentor junior researchers and contribute to the broader research community by reviewing for high-impact journals and conferences. Above all, I am passionate about continuous learning and enjoy exploring novel ideas that push the boundaries of what's possible with machine learning.

Skills

Experience

Research Associate

11/2024 - current

Vision and Image Processing Lab (VIP), Dept. of Systems Design Engineering

University of Waterloo, Waterloo, Canada

Projects: BIOSCAN-5MSuperFormer

Expertise: LLMs - Generative AI - Big Data Analytics - Biotechnology - Data Science

Research Fellow

02/2022 - 11/2024

Vision and Image Processing Lab (VIP), Systems Design Engineering

University of Waterloo, Waterloo, Canada

Projects: BIOSCAN-1MCausalityMoE-VRD

Expertise: Biotechnology - Data Science - Statistical Processing - Causal Inference - Video Analysis

Postdoc

09/2018 - 02/2022

Computer Vision Lab (CVL), Dept. of Electrical engineering

Linköping University, Linköping, Sweden

Projects: SLOPE-KPGAINBRL-VBVC

Expertise: Graph Neural Networks - Computer Vision - 3D Shape Reconstruction - Reinforcement Learning

Projects

BIOSCAN-5M

BIOSCAN-5M

About: Led the creation of a 5M-image multimodal dataset with standardized metadata, scalable preprocessing, and Hugging Face integration. Enabled robust ML experimentation, multimodal benchmarking, and large-scale taxonomic and statistical analysis.

Keywords: Biotechnology - Data science - Statistical processing, Supervised learning - Unsupervised learning - BERT - CLIP - Contrastive learning - Zero-shot clustering

View Project
SuperFormer

SuperFormer

About: A Transformer-based model that leverages superpixels for efficient Salient Object Detection (SOD).

Keywords: SWIN Transformer - Multimodal feature Representation - Efficient salient object detection - Fine-tuning - Pre-training - Fourier transformers

View Project
BIOSCAN-1M

BIOSCAN-1M

About: Developed a large-scale, taxonomy-aware insect image dataset with automated ML pipelines, robust model benchmarking (ResNet50, ViT), and reproducible evaluation across diverse experimental settings.

Keywords: Biotechnology - Data science - Vision transformers - Object detection - Big data analytics - Fine-tuning - Transfer Learning

View Project
Causality

Causality

About: Two approaches to generative and implicit causal representation learning.

Keywords: Causal inference - Representation learning - Generative models - Domain Adaptation - Domain generalization - Interventions - Variational autoencoders

View Project
MoE-VRD

MoE-VRD

About: Collaborated with Microsoft Media Group on modeling subject-object interactions in visual scenes. Developed and refined a framework for action recognition and scene understanding.

Keywords: Computer Vision - Scene graph generation - Video relationship detection - Mixture of experts

View Project
SLOPE-KP

SLOPE-KP

About: Co-led a project on 3D shape reconstruction and pose prediction from single images using self-supervised learning. Developed and evaluated a keypoint-based pipeline across static and dynamic datasets, achieving notable gains in accuracy and generalization.

Keywords: Computer vision - 3D shape reconstruction - Pose estimation - 3D Rotation

View Project
GAIN

GAIN

About: An approach to graph representation learning.

Keywords: Graph neural networks - Representation learning - Geospatial data analysis - Classification - Road network graphs - Urban infrastructure

View Project
BRL-VBVC

BRL-VBVC

About: An approach to Bayesian Reinforcement Learning of vision-based vehicular control.

Keywords: Autonomous systems - Reinforcement learning - Simulated environment - Semantic segmentation - Bayesian RL

View Project

Education

Ph.D. in Cognitive Science

03/2014 - 05/2018

Thesis: Action in mind: A neural network approach to action recognition and segmentation.

Lund University Cognitive Science (LUCS)

Lund University, Sweden

M.Sc. in Mechatronics

09/2009 - 02/2012

Thesis: Attention control learning in Decision Space Using State Estimation.

Advanced Process, Automation and Control Research Group (APAC)

K.N. Toosi University of Technology, Iran

B.Sc. in Electrical Engineering

09/2005 - 09/2009

Thesis: Design and implementation of a MIMO controller for a quadruple tank system.

Advanced Process, Automation and Control Research Group (APAC)

K.N. Toosi University of Technology, Iran

Certificates

Generative AI with Large Language Models

04/2025

Skills: LLMs, GenAI, AWS, Python

Issued by DeepLearning.AI and Amazon Web Services via Coursera.

  • Gain foundational knowledge, practical skills, and a functional understanding of how generative AI works
  • Dive into the latest research on Gen AI to understand how companies are creating value with cutting-edge technology
  • Instruction from expert AWS AI practitioners who actively build and deploy AI in business use-cases today

Public Speaking

Computational Entomology Webinar III: Processing liquid samples

06/2024

About: Computer vision and robotic tools to automatically process insect specimens stored in liquid samples.

Vancouver, Canada

GEO BON Global Conference: Monitoring Biodiversity for Action

10/2023

About: AI for Insect Monitoring.

Montreal, Canada

LiU Game Conference

11/2018

About: Computer games, visualization and digital experiences.

Linköping, Sweden

Selected Publications