PhD Fellowship(s) in AI Safety and Mechanistic Interpretability of Language Models

2 days ago


Odense, Odense Kommune, Denmark University of Southern Denmark Full time 48,000 - 64,000 per year

The Centre for Machine Learning within the Data Science and Statistics Section of the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark invites applications for PhD research fellowship position(s) within the field of machine learning, AI Safety, and mechanistic interpretability, to be filled earliest by 1 January 2026 for a period of three years.

The application deadline is December 10, 2025, 23:55 CET.

The research will be conducted within the MIST project (Scalable Mechanistic Interpretability for Safe and Trustworthy LLM Agents), recently funded by the Novo Nordisk Foundation. The project aims to develop scalable methods for understanding the inner workings of large language models (LLMs) and LLM agents, with a focus on identifying causal mechanisms underlying tool use, reasoning, and multi-agent communication. The research will investigate cross-model and cross-lingual universality of these mechanisms and develop functionally grounded steering techniques and methods to issue safety certificates.

Research topics include (but are not limited to) interpretability and transparency, agentic and multi-agent safety, control and containment, safety evaluation, sustainability and resource impact. Selected candidates may also contribute to ongoing work in the Danish Foundation Models project, particularly on multilinguality, evaluation, and efficiency aspects of language models.

We are seeking candidates with:

  • Strong desire to make significant contributions to science and specific interest in AI Safety and mechanistic interpretability research
  • Creative and independent thinking to develop novel approaches to challenging research problems
  • Outstanding theoretical background in machine learning and deep learning, demonstrated by excellent course grades and, where applicable, by research experience or scientific publications
  • Strong programming skills in Python and deep learning frameworks such as PyTorch, demonstrated by coursework, projects, or contributions to public code repositories
  • Experience with or strong interest in large language models, transformers, and natural language processing
  • Interest in multilingual models is a plus
  • Excellent spoken and written communication skills in English
  • Background in interpretability methods, causal inference, or multi-agent systems is advantageous but not required

The successful candidate will have the unique opportunity to contribute to establishing a new research group on AI Safety at SDU and will participate in publishing high-quality research papers at top-tier machine learning and NLP venues such as NeurIPS, ICLR, ACL, and EMNLP. The candidate will also fulfill teaching assistantship duties.

We will consider candidates who have (or will obtain before the start date) a Master's degree in Computer Science, Data Science, Artificial Intelligence, Mathematics, Statistics, or related fields. Candidates should demonstrate that they have passed at least two Master's level courses that cover advanced machine learning topics with a grade that corresponds to the top 10% within the native grade scale or can show comparable achievements such as a Master's project focusing on topics relevant to the PhD position.

IMADA has the unique feature of bringing mathematicians and computer scientists together within a single department to foster theoretically well-backed high-quality data science research. IMADA is home to many ongoing externally funded research projects, as well as to a rich curriculum of data science and artificial intelligence courses. Data Science and Statistics Group is a synergy platform for the data science experts in IMADA.

Place of work

The Department of Mathematics and Computer Science is located at the main campus of the University of Southern Denmark, Odense, Denmark. The University of Southern Denmark was founded in 1966 and now has more than 27,000 students, almost 20% of whom are from abroad. It has more than 3,800 employees, and 115 different study programmes in the fields of the humanities, social sciences, natural sciences, health sciences, and engineering. Its main campus is located in Odense, the third largest city in Denmark.

Odense provides family-friendly living conditions with the perfect combination of a historic city centre with an urban feel and yet a close proximity to beaches and recreational areas. Its location on the beautiful island of Funen is ideal with easy access by train or highway to the bigger cities of Aarhus and Copenhagen. As the birthplace of Hans Christian Andersen, Denmark's famous fairytale author, the city is home to a vibrant and creative population that hosts numerous festivals and markets throughout the year.

Contact information

For further questions about the position please contact Assistant Professor Lukas Galke Poech at .

If you experience technical problems, please contact hcm-

[Udfyldes af NAT HR]

  • Conditions of employment
  • Application, salary etc.
  • About SDU


  • Odense M, Odense Kommune, Denmark University of Southern Denmark Full time 160,000 - 182,000 per year

    DescriptionThe Centre for Machine Learning within the Data Science and Statistics Section of the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark invites applications for PhD research fellowship position(s) within the field of machine learning, AI Safety, and mechanistic interpretability, to be filled earliest...


  • Odense, Odense Kommune, Denmark SDU Career Site Full time 45,000 - 70,000 per year

    The Centre for Machine Learning within the Data Science and Statistics Section of the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark invites applications for PhD research fellowship position(s) within the field of machine learning, AI Safety, and mechanistic interpretability, to be filled earliest by 1 January...


  • Odense, Odense Kommune, Denmark University of Southern Denmark Full time 40,000 - 80,000 per year

    The Centre for Machine Learning within the Data Science and Statistics Section of the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark invites applications for postdoctoral research fellowship position(s) within the field of machine learning, natural language processing, and AI safety. The proposed starting date is...


  • Odense M, Odense Kommune, Denmark University of Southern Denmark Full time 1,000 - 40,000 per year

    DescriptionThe Centre for Machine Learning within the Data Science and Statistics Section of the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark invites applications for postdoctoral research fellowship position(s) within the field of machine learning, natural language processing, and AI safety. The proposed...


  • Odense, Odense Kommune, Denmark University of Southern Denmark Full time 70,000 - 120,000 per year

    The Department of Mathematics and Computer Science at the University of Southern Denmark, Odense, invites applications for a postdoctoral research fellowship in models of quantum programming languages. The position has a duration of 3 years. Due to the highly interdisciplinary nature of the position, the hired candidate will be part of both the section of...


  • Odense, Odense Kommune, Denmark SDU Career Site Full time 70,000 - 120,000 per year

    The Department of Mathematics and Computer Science at the University of Southern Denmark, Odense, invites applications for a postdoctoral research fellowship in models of quantum programming languages. The position has a duration of 3 years. Due to the highly interdisciplinary nature of the position, the hired candidate will be part of both the section of...


  • Odense, Odense Kommune, Denmark University of Southern Denmark Full time 45,000 - 70,000 per year

    The Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark, Odense, invites applications for a PhD position in quantum programming languages. The position has a duration of 3 years. Due to the highly interdisciplinary nature of the position, the hired candidate will be part of both the section of Artificial Intelligence,...


  • Odense, Odense Kommune, Denmark University of Southern Denmark Full time 55,000 - 70,000 per year

    SDU Civil and Architectural Engineering (CAE) invites applications for a PhD position in Structural Engineering, with a focus on advancing probabilistic modelling techniques for assessing and improving the performance of existing structures. The position is to be filled by February 1, 2026, or as soon as possible thereafter.Job DescriptionThe successful...


  • Odense, Odense Kommune, Denmark SDU Career Site Full time 50,000 - 65,000 per year

    The Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark, Odense, invites applications for a PhD position in quantum programming languages. The position has a duration of 3 years. Due to the highly interdisciplinary nature of the position, the hired candidate will be part of both the section of Artificial...


  • Odense, Odense Kommune, Denmark University of Southern Denmark Full time 40,000 - 60,000 per year

    At the laboratory of Associate Professor Kumar Somyajit, we seek a highly motivated PhD student with strong background in cell biology and prime interest in cell cycle, genome maintenance, cancer mechanisms, and quantitative imaging. Experience with CRISPR-based gene editing, optical microscopy, and quantitative skills are required. Previous work in the...