About Me
About
I received my MSc in Artificial Intelligence at the University of Essex, supervised by Professor Luca Citi. Previously, I obtained my bachelor's degree from the University of Kurdistan. During my postgraduate studies, I was working on reasoning in LLMs for code generation; as a result, I developed a new approach for guiding the generation process. It consisted of a separate deep-think stage with self-reflection (somehow similar to OpenAI o1) and a direct integration of thoughts (hidden states) into the LLM's main generation's hidden states using model surgery! This framework called NIR is applicable to various LLMs without finetuning. As an extension, It has a thought memory manager which controls the memories and thoughts of the LLM (not included in the evaluations) which allows the integration of approaches like Neural Turing Machines or DNC and LLMs. I have also contributed to the open-source projects and have collaborated with research teams in the past.
Research Interests
My ultimate objective is to study the cognitive mechanisms underlying intelligence and develop embodied agents able to reason and interact with the real world.
A major challenge in current AI models is their struggle with consistent logical reasoning and decision-making, especially in scenarios that require multi-step inference or handling of previously unseen situations. Current models (ie. LLMs) can generate highly semantically coherent text, yet, often fail in robust reasoning abilities that are necessary for tasks such as complex problem-solving, planning, or adapting to novel environments.
Researchers have explored many approaches such as Prompt engineering methods (eg. CoT and self-reflection), which
can enhance reasoning abilities to an extent as a temporarily and short-term solution, however, they are limited
and are not architecturally native to the models.
Therefore, my research focuses on developing differentiable approaches for implementing iterative reasoning and
decision-making in uncertain environments in embodied agents.
This involves developing new architectures that can maintain coherent thought processes across multiple inference steps.
Currently, I am exploring diffusion models, score-based and flow-based models, differential geometry (particularly Riemannian geometry), metric learning, energy-based models, joint embedding predictive architectures operating in the latent spaces among others to design generalizable intelligent agents capable of reasoning (as an optimization problem), deliberate planning (system 2-like), and able to handle uncertainty (in the inference-time)
In a wider context, I am fascinated by the idea of developing a network of specialized networks such as memory and rapid learning, goal-driven planning, spatial reasoning, and error detection and conflict monitoring that operate collaboratively to consistently make good decisions within a complex environment. I am interested in transformers and attention mechanisms with a bias toward efficient computation in these settings: generative models, multimodal learning, and self-supervised learning.
Research Highlights
- Reasoning and Planning: Embodied agents able to reason, plan and adapt to new environments
- Representation learning: Efficiently learn the geometry of parameters and data distributions
- Handling OOD problems: Robust out-of-distribution generalization
- Embodied Intelligent Agents: Agents that comprise the above capabilities and able to learn while interacting with the world
News
- I defended my MSc at the University of Essex - CSEE! (Thesis) - Oct. 2024
Awards
- Fully funding scholarship for postgraduate studies in Artificial Intelligence, University of Essex
Education
University of Essex
MSc. Artificial Intelligence
2023 - 2024
Dissertation: Neural Integration of Iterative Reasoning (NIR) in LLMs for Code Generation (Website)
Other:
- An Automatic Differentiation and Backpropagation Framework from Scratch (with a Keras-like API) (Codes)
- DL Competition Winner 🥇: Inter-Departmental Neural Network Challenge: Rossmann Store Sales
- Top scores among cohort (100+ participants): 0.10886 (public) / 0.11384 (private)
- Efficient LLM-centric Indoor Navigation Glasses using Jetson Mini
- RL-based Agents for Resistance (social game), Pacman, Chasing and escaping vehicles etc.
University of Kurdistan
BEng. Computer Engineering (Software)
2014 - 2018
Thesis: EfficientCoF - A New Efficient Subspace and K-Means Clustering-based Method to Improve Collaborative Filtering (Codes)
Other:
- Combined Adaptive Neuro-fuzzy Inference System and Particle Swarm Optimization (ANFIS-PSO) for predicting emissions from engine with different fuels (2016)
Experience
- Developed NIR Framework
- Researching Reasoning
- TorchEBM library- Under development (a high performance and scalable Python lib for energy-based models)
- TransformerX library
- Emgraph library
- Bigraph library
Teaching Assistant
Oct. 2018 - Jan. 2019
- Social Networks Analysis: Python, Gephi, and review sessions
- Developed EfficientCof
Intern - KurdCloud
Jan. 2017 - Mar. 2018
- Developed NFC-based Attendance Tracking Systems
Projects
Oct. 2024-Present
Energy-Based Modeling library for PyTorch. Provides tools for sampling, inference, and learning in complex distributions. Keywords – variational-inference, Reasoning, Sampling, langevin-dynamics, Contrastive Divergence, Energy-based-model, Diffusion-models probabilistic-ML, Score-matching, Generative-ai
2021-2022
A Python library for building transformer-based models. Keywords – Deep-learning, Machine-learning, Transformers, Attention mechanisms, Algorithms, python library
2021-2022
A Python toolkit for embedding knowledge graph, developing and evaluating models Keywords – Deep-learning, Machine-learning, Graph, Knowledge-graph-embedding, Algorithms
2019-2022
A python library for link prediction in bipartite-networks</b>
Keywords – link-prediction, Machine-learning, Graph, Graph-analysis
2020-2022
A lightening-fast audio search engine python library on top of the Telegram messenger platform</b>
Keywords – Audio-search-engine, Audio-indexing, Knowledge-graph