Soran Ghaderi

ML Research Engineer at CoSTAR National Lab | Incoming DPhil student in Computer Science

Soran Ghaderi

About

ML Research Engineer at CoSTAR National Lab working on video and world models for virtual production. Previously research intern at UIUC Blender Lab (energy-based models). MSc AI (Distinction) from the University of Essex, supervised by Prof. Luca Citi.

Research Interests

A representation of a system can be learned and refined from partial views and measurements. A signal, a geometric system evolving over time, or an image could be the different faces of the same underlying system or its evolved state. Most of what we ask of such a system, whether we are recovering something hidden, reconstructing what we cannot observe directly, or anticipating how it will behave, is the same question asked through a different measurement.

I'm interested in developing differentiable methods for learning such representations.

When a general method can absorb the structure of a problem from scale, it usually overtakes the carefully engineered alternative .

Taken to its limit this points to one model rather than many. A representation of biological systems wide enough to reach across scales, from molecular structure through anatomy and imaging to how a patient changes over time, learned together so that what is known at one scale sharpens inference at another.

▸ Key areas Click to expand
  • One representation, many questions: Treating tasks as conditionals of a single learned model of the system, where generalisation comes from a representation made rich by varied measurements and data modalities rather than from composition built in by hand.
  • Continuous-time generative models: Diffusion and score-based models, flow matching and stochastic interpolants, Schrödinger bridges, optimal transport, and energy-based models, used as learned priors and samplers. Built on ODEs and SDEs, Fokker-Planck and Itô calculus, and probability-flow ideas.
  • Amortised and simulation-based inference: Neural posterior estimation, amortised variational inference, and likelihood-free methods that turn expensive per-case fitting into a single forward pass.
  • Learned priors for ill-posed problems: Expressive generative models that make sense of measurements which do not determine a unique answer, with calibrated uncertainty and correct coverage.
  • Geometry and operators: Riemannian geometry and shape spaces for representation, and links to statistical mechanics.
  • Generalisation and robustness: Out-of-distribution behaviour, transfer across systems and regimes, and reliability when data are scarce or noisy.
  • Applications: Scientific inference, with digital twins and cardiac modelling as the present testbed.

Current Focus. Bringing diffusion, Schrödinger bridges, flow-based, and energy-based models under one view through statistical mechanics and differential geometry.

Writing

New Video World ModelsFrom slow bidirectional diffusion to real-time autoregressive world simulators. Causality, self-forcing, attention sinks, and the path to interactive video generation.

News

  • New Joined CoSTAR National Lab as ML Research Engineer - Feb. 2026
  • Research Collaboration with UIUC Blender Lab - Jul. 2025
  • MSc defended at Essex (NIR thesis) - Oct. 2024

Awards

  • Full scholarship for MSc AI, University of Essex

Education

University of Essex

MSc Artificial Intelligence (Distinction)

2023 - 2024

Dissertation: NIR — Neural Integration of Iterative Reasoning in LLMs for Code Generation

🥇 1st place, Inter-Departmental Neural Network Challenge (100+ participants)

University of Kurdistan

BEng Computer Engineering

2014 - 2018

Thesis: EfficientCoF — Subspace clustering for collaborative filtering

Experience

CoSTAR National Lab

ML Research Engineer

Feb 2026 - Present

Generative video and world models for virtual production. [Profile]

UIUC Blender Lab

Research Intern

Jul 2025 - Jan 2026

Energy-based transformers for image/video generation. Investigating mode collapse, inference-time compute scaling, and fast sampling strategies.

University of Essex

Postgraduate Researcher

Apr 2024 - Oct 2024

Developed NIR framework for integrating iterative reasoning into LLM hidden states without fine-tuning.

Open Source

Developer & Maintainer

2019 - Present

TorchEBM, cuRBLAS, TransformerX, Emgraph, Bigraph, TASE. See Projects section.

Projects

Updated Read more
PyTorch library for energy-based models, diffusion, and flow matching. Implements samplers, score/flow/contrastive objectives, SDE/ODE integrators, interpolants, and mixed-precision training.
PyTorch · CUDA
GitHub Repo stars PyPI - Downloads Downloads

Soon High-throughput world model inference engine. Stateful session management for BiD and AR world models (DreamDojo, Cosmos, DreamZero, etc.) with persistent KV-cache, CUDA graph capture, continuous batching, speculative decoding, and torch.compile fusion. Sub-millisecond scheduling via async token queues; REST/WebSocket streaming with backpressure.
PyTorch · CUDA · Python

GPU-accelerated randomized linear algebra. CUDA kernels for Hutchinson trace estimation, randomized SVD, and probabilistic matrix operations—useful for sliced score matching and large-scale ML.
C++ · CUDA · Python
GitHub Repo stars GitHub last commit

Modular transformer research library. Composable attention mechanisms, positional encodings, and architecture variants for rapid prototyping.
TensorFlow
GitHub Repo stars

Knowledge graph embedding library. Train and evaluate TransE, DistMult, ComplEx, and other relational models on link prediction tasks.
TensorFlow 2
GitHub Repo stars PyPI - Downloads

Bipartite graph link prediction. Implements Jaccard, Adamic-Adar, Common Neighbors, Preferential Attachment, and Katz similarity for two-mode networks.
NetworkX · NumPy
GitHub Repo stars PyPI - Downloads

Large-scale audio search engine. Scalable indexing and retrieval with Elasticsearch, ArangoDB, and Redis for real-time multimodal search.
Elasticsearch · Python
GitHub Repo stars