Soran Ghaderi

About Me

About

I received my MSc in Artificial Intelligence at the University of Essex, supervised by Professor Luca Citi. Previously, I obtained my bachelor's degree from the University of Kurdistan. During my postgraduate studies, I was working on reasoning in LLMs for code generation; as a result, I developed a new approach for guiding the generation process. It consisted of a separate deep-think stage with self-reflection and a direct integration of thoughts (hidden states) into the LLM's main generation's hidden states using model surgery! This framework called NIR is applicable to various LLMs without finetuning. As an extension, It has a thought memory manager which controls the memories and thoughts of the LLM (not included in the evaluations) which allows the integration of approaches like Neural Turing Machines or DNC and LLMs. I have also contributed to the open-source projects and have collaborated with research teams in the past.

Research Interests

My ultimate objective is to study the cognitive mechanisms underlying intelligence and develop embodied agents able to reason and interact with the real world.

A major challenge in current AI models is their struggle with generalization, consistent logical reasoning, and decision-making, especially in scenarios that require multi-step inference or handling previously unseen situations. Current models (ie. LLMs) can generate highly semantically coherent text, yet, often fail in robust reasoning abilities that are necessary for tasks such as complex problem-solving, planning, or adapting to novel environments.

Researchers have explored many approaches such as Prompt engineering methods (eg. CoT and self-reflection), which can enhance reasoning abilities to an extent as a temporary and short-term solution, however, they are limited and are not architecturally native to the models. Therefore, my research focuses on developing differentiable approaches for implementing iterative reasoning and decision-making in uncertain environments in embodied agents. This involves developing new architectures that can maintain coherent thought processes across multiple inference steps.

Currently, I am exploring diffusion models, score-based and flow-based models, differential geometry (particularly Riemannian geometry), metric learning, energy-based models, joint embedding predictive architectures operating in the latent spaces among others to design generalizable intelligent agents capable of reasoning (as an optimization problem), deliberate planning (system 2-like), and able to handle uncertainty (in the inference-time)

In a wider context, I am fascinated by the idea of developing a network of specialized networks such as memory and rapid learning, goal-driven planning, spatial reasoning, and error detection and conflict monitoring that operate collaboratively to consistently make good decisions within a complex environment. I am interested in transformers and attention mechanisms with a bias toward efficient computation in these settings: generative models, multimodal learning, and self-supervised learning.

Research Highlights

Reasoning and Planning: Embodied agents able to reason, plan and adapt to new environments
Representation learning: Efficiently learn the geometry of parameters and data distributions
Handling OOD problems: Robust out-of-distribution generalization
Embodied Intelligent Agents: Agents that comprise the above capabilities and able to learn while interacting with the world

News

I defended my MSc at the University of Essex - CSEE! (Thesis) - Oct. 2024

Awards

Fully funding scholarship for postgraduate studies in Artificial Intelligence, University of Essex

Education

University of Essex

MSc. Artificial Intelligence

2023 - 2024

Dissertation: Neural Integration of Iterative Reasoning (NIR) in LLMs for Code Generation (Website)

Other:

An Automatic Differentiation and Backpropagation Framework from Scratch (with a Keras-like API) (Codes)
DL Competition Winner 🥇: Inter-Departmental Neural Network Challenge: Rossmann Store Sales
- Top scores among cohort (100+ participants): 0.10886 (public) / 0.11384 (private)
Efficient LLM-centric Indoor Navigation Glasses using Jetson Mini
RL-based Agents for Resistance (social game), Pacman, Chasing and escaping vehicles etc.

University of Kurdistan

BEng. Computer Engineering (Software)

2014 - 2018

Thesis: EfficientCoF - A New Efficient Subspace and K-Means Clustering-based Method to Improve Collaborative Filtering (Codes)

Other:

Combined Adaptive Neuro-fuzzy Inference System and Particle Swarm Optimization (ANFIS-PSO) for predicting emissions from engine with different fuels (2016)

Experience

Postgraduate Student Researcher (Reasoning)

Apr 2024 - present

Developed NIR Framework
Researching Reasoning

Open-Source Developer

2019 - present

TorchEBM library- Under development (a high performance and scalable Python lib for energy-based models)
TransformerX library
Emgraph library
Bigraph library

Teaching Assistant

Oct. 2018 - Jan. 2019

Social Networks Analysis: Python, Gephi, and review sessions

Undergraduate Research Assistant

Sep. 2017 - Jun. 2018

Developed EfficientCof

Intern - KurdCloud

Jan. 2017 - Mar. 2018

Developed NFC-based Attendance Tracking Systems

Projects

cuRBLAS 🍒

https://github.com/soran-ghaderi/cuRBLAS

Jun. 2025-Present
CUDA-accelerated randomized BLAS (RBLAS) algorithms & routines—cuBLAS for modern randomized linear algebra.
Keywords – CUDA, cuBLAS, cuRBLAS, NVIDIA, HPC, GPU-programming, Probabilistic methods, Linear algebra, Randomized methods
GitHub last commit

TorchEBM 🍓

github.com/soran-ghaderi/torchebm

Oct. 2024-Present
Energy-Based Modeling library for PyTorch. Provides tools for sampling, inference, and learning in complex distributions.
Keywords – variational-inference, Reasoning, Sampling, langevin-dynamics, Contrastive Divergence, Energy-based-model, Diffusion-models probabilistic-ML, Score-matching, Generative-ai
PyPI - Downloads GitHub last commit

TransformerX

github.com/tensorops/TransformerX

2021-2022
A Python library for building transformer-based models.
Keywords – Deep-learning, Machine-learning, Transformers, Attention mechanisms, Algorithms, python library
PyPI - Python Version PyPI - Implementation GitHub last commit PyPI - License PyPI - Format

Emgraph

github.com/bi-graph/emgraph

2021-2022
A Python toolkit for embedding knowledge graph, developing and evaluating models
Keywords – Deep-learning, Machine-learning, Graph, Knowledge-graph-embedding, Algorithms
PyPI - Downloads latest release

Bigraph

github.com/bi-graph/bigraph

2019-2022
A python library for link prediction in bipartite-networks
Keywords – link-prediction, Machine-learning, Graph, Graph-analysis
PyPI - Downloads latest release

TASE (Telegram Audio Search Engine)

github.com/appheap/TASE

2020-2022
A lightening-fast audio search engine python library on top of the Telegram messenger platform</b>
Keywords – Audio-search-engine, Audio-indexing, Knowledge-graph
PyPI - Downloads

Looking for PhD & Research/Job Opportunities - MSc in AI at the University of Essex