PGR Seminar – Erdem Kus & Junyu Zhang

You are warmly invited to the next PGR Seminar.

Date & Time: Monday 20/10/2025 14:00-15:00

Location: JC 1.33A

  1. Speaker: Erdem Kus

Title: Frugal Algorithm Selection for Combinatorial Search

Abstract: Solvers for combinatorial search and optimisation problems often exhibit highly complementary performance: instances that are hard for one solver may be easy for another. The Algorithm Selection Problem (ASP) addresses this by predicting, for each problem instance, which solver will perform best. Machine learning models trained for this purpose, however, are typically expensive to construct, as they require exhaustive solver runs on all training instances to obtain ground-truth performance data.

In this work, we propose a frugal alternative that formulates algorithm selection as an active learning problem. Instead of uniformly evaluating all solver–instance pairs, our method intelligently selects the most informative ones, thereby drastically reducing the cost of data collection. We show that standard active learning techniques are inadequate for this setting, as they overlook the structure and cost characteristics unique to algorithm selection. To address this, we introduce novel, cost-aware active learning strategies that leverage auxiliary models to balance informativeness and evaluation cost.

Bio: Erdem is a PhD candidate whose research focuses on Artificial Intelligence (AI) and Constraint Programming (CP).

  1. Speaker: Junyu Zhang

Title: Remaking Characters in Heritage Contexts to Support Inclusive Learning

Abstract: Characters in immersive environments have the potential to enrich user experience, improving engagement with heritage and in so doing benefiting heritage organisations and their communities. Creating authentic digital scenes based upon survey, archaeological and historical data, co-creative design and community engagement enables communities and their visitors to understand the past better. The understanding of authenticity stimulates the potential of enriching cultural heritage with the details of lives past and also discusses how this research benefits the Sustainable Development Goals.

Bio: Minty is a PhD candidate exploring the authenticity of characters to support inclusive learning in heritage contexts. She is interested in how digital technologies can be used in the intersection of different disciplines to achieve SDGs in the field of cultural heritage, so as to enhance the promotion, representation, and well-being in digital humanities education and also affect resonated dialogue and thinking among diverse people and communities in facing the current challenges.

We hope you can join us!

Inaugural Lecture showcase Wednesday 15th October

The Inaugural Lecture showcase takes place at Buchanan Lecture Theatre on Wednesday 15th October at 4:00pm.

Professor Richard Connor
‘Finding New Dogs with Old Tricks’

Professor Juliana Bowles
‘The Temporality of Formal Reasoning: As it was, as it is and as it could be’

Professor Susmit Sarkar
‘Peering Inside the Box: Making Modern Computing Safer with Mathematical Specifications’

Please come along if you can and support the school and our colleagues.

PGR Seminar – Qurat ul ain Shaheen

You are warmly invited to the next PRG Seminar.

Date & Time: Monday 13/10/2025 14:00-14:40

Location: JC 1.33A

Speaker: Qurat ul ain Shaheen

Title: A Framework for Uncertainty Sampling in Active Learning

Abstract: Uncertainty sampling is an active learning paradigm where data instances representing maximum uncertainty for a machine learning model are selected for training. This talk will explore existing uncertainty modelling approaches for binary classification of categorical data.  It will introduce a conceptual framework to improve uncertainty modelling and present some preliminary results.

Bio: Qurat ul ain Shaheen is a final year PhD researcher. Her research focuses on modelling uncertainty in active learning.

We hope you can join us!

Young Software Engineer of the Year 2025 Awards

Huge congratulations to Verity Powel, a winner at last night’s Young Software Engineer of the Year Awards (https://www.scotlandis.com/blog/rugby-video-tech-scores-top-award-for-st-andrews-student/). Her final year project “Video Analytics For Rugby Skills Training” was nominated by the school (https://blogs.cs.st-andrews.ac.uk/csblog/2025/07/28/nomination-to-young-software-engineering-of-the-year-awards-2025/) in June. The awards were announced at the ScotSoft 2025 (https://www.scotlandis.com/scotsoft-2025/), Scotland’s leading tech conference at the Edinburgh International Conference Centre.

The Young Software Engineer of the Year accolades are awarded to the best undergraduate software projects from students studying computer science and software engineering in Scotland. Over the years, St Andrews has many finalists and prize winners.

PGR Seminar – David Morrison

You are warmly invited to the next PRG Seminar.

Date & Time: Monday 06/10/2025 14:00-14:40

Location: JC 1.33A

Speaker: David Morrison

Title: Synthetic Whole Slide Image Patch Embeddings for Multiple Instance Learning

Abstract: Obtaining high-quality data is a persistent challenge for the training of computational pathology models. As medical data, Whole-slide images (WSIs) are often held under restrictive terms by medical institutions and, as a result, are hard to access by researchers. Where data is available, the number of whole slide images can be limited and skewed towards common pathology types. In addition, there can be issues with labelling: slide-level labels may lack information about specific pathologies, for example, they may be limited to binary labels of normal or malignant, while annotations at the level of patches are rarely available.

Synthetic data generation is a possible solution to these problems by allowing researchers to produce data on demand that can be used in an unrestricted manner with high-quality labels. I have previously presented on the generation of synthetic patch data. In this talk, I will discuss an extension to this work in which this approach is combined with models trained to characterise the slide as a whole in order to provide a synthesis process for data for use with multiple instance learning techniques, commonly used in whole slide image classification.

We hope you can join us!

 

Seminar series on computing intelligence

There will be a series of talks at the Global Research Centre for Diverse Intelligences which might be interesting to staff in the School.

It will be a mix of discussions about how different fields (i.e., not just CS) think about intelligence and some talks about various sub-fields of AI presented by CS staff.

Talks by Ruth Hoffmann, Nguyen Dang, and Phong Le will be about foundational AI topics: https://diverseintelligences.st-andrews.ac.uk/events/

 

PGR Seminar – Sharon Pisani & Mirza Hossain

The next PGR seminar is taking place this Friday 3rd October 11:00-12:00 in JC 1.33A.

Below are the Titles and Abstracts for Sharon and Mirza’s talks – Please do come along if you are able.

Sharon Pisani

Title: Building Sustainable Heritage Virtual Museums for Communities using Sociodata

Abstract: Virtual museums are moving beyond simple digitisation of artefacts to become dynamic platforms for community engagement and sustainable development. This talk introduces the VERA Platform, which combines a flexible Virtual Museum Infrastructure with a new layer of sustainability-oriented contextual data called sociodata. Sociodata links heritage objects to their cultural landscapes, local communities, and relevant Sustainable Development Goals, enabling richer discovery, analysis, and reuse. In this talk, I will outline the platform’s architecture and metadata model. The talk will highlight technical challenges such as interoperability with European data spaces, and supporting interactive storytelling at scale—issues highly relevant to digital infrastructure and data-driven research in the heritage sector.

Bio: Sharon is a PhD researcher examining the role of emergent digital technologies in preserving and engaging with cultural heritage while supporting sustainable development. Her research focuses on digitising cultural landscapes—both natural and cultural heritage—to assess various impacts on heritage and community identities. She explores how digital tools, including 3D scanning, 3D modeling, and mixed reality, can aid in recreating and safeguarding heritage at risk.

Mirza Hossain

Title: Fishing for monosemantic neurons in histopathology foundation models

Abstract: This early-stage study introduces Histoscope, an interactive system for examining sparse autoencoders (SAEs) that are trained on top of the UNI pathology encoder. Vision transformers for histopathology often exhibit superposition, where single neurons respond to multiple distinct tissue patterns, making interpretation difficult. Histoscope provides quantitative metrics and visualisations to assess whether neurons are monosemantic—associated with a single concept—or polysemantic—associated with multiple concepts. The work highlights methods for analysing internal representations of histopathology foundation models and contributes to efforts toward more transparent AI in pathology.

Bio: Mirza Hossain is a second-year PhD candidate in Computer Science at the University of St Andrews. His research focuses on multimodal AI in medical imaging with an emphasis on mechanistic interpretability of large foundation models. He is supervised by Dr. David Harris-Birtill.

 

Curious Workshop Success

On Saturday 13th September, the Royal Society of Edinburgh hosted a workshop titled: “Your Data, Your Story” as part of their Curious festival of knowledge. The event was led by a group of experts in data visualisation and data ethics, including three of our own, Dr Areti Manataki, Dr Tristan Henderson and Tilcia Woodville-Price.

The interactive workshop was designed for the general public, in particular for non-experts that are curious about data visualisation and data ethics. Participants were prompted to reflect on their own examples of personal data and think about the ethics behind the collection of personal data. This was followed by a brief introduction into data visualisation and physical representations of data (also called data physicalisations). Lastly came a hands-on activity which featured the use of everyday objects, such as string, pasta and Lego, to create physical representations of our data.

It was fascinating to speak with participants and learn what aspects of their personal data were important to them. Participants physicalized data ranging from their social media usage to the quality of interactions with their children, to the data about emails received versus responded and everything in between.

What about you? What kind of data is meaningful to you? Have you thought about how visualising data may help your daily activities or give you an opportunity for reflection?

Workshop team:

Dr Areti Manataki, Lecturer, School of Computer Science, University of St Andrews

Dr Uta Hinrichs, Reader in Data Visualisation, University of Edinburgh

Dr Tristan Henderson, Senior Lecturer, School of Computer Science, University of St Andrews

Dushani Perera, Research Associate, University of Edinburgh

Tilcia Woodville-Price, Doctoral Candidate, University of St Andrews

School Seminar – Peter Macgregor “Fast Dynamic Algorithms for Modern Clustering”

You are warmly invited to the second School Seminar:

Speaker: Peter Macgregor

Title: Fast Dynamic Algorithms for Modern Clustering

Abstract: Spectral clustering and DBSCAN both have long histories as theoretically grounded, general-purpose clustering algorithms. However, they face practical challenges when scaling to large datasets which have limited their adoption in practice.

In recent work, we have developed several improvements to these algorithms which improve their running time and space complexity while preserving their performance guarantees and generalising them to dynamically changing datasets. We make use of several algorithmic techniques including sparsification, dimensionality reduction, and random sampling. In this talk, I will present the recent progress and make the case that it’s time to challenge k-means’ dominance as the ‘default’ clustering algorithm.

Date & Time: Thursday 16/10/2025 11am-12pm

Location: JC 1.33A

Please do come along and join us! 🙂