Marina Romanchikova (NPL): How good are our data? Measuring the data quality at National Physical Laboratory (School Seminar)

Abstract:

From mapping the spread of disease to monitoring climate change, data holds the key to solving some of the world’s biggest challenges. Dependable decisions rely on understanding the provenance and reliability of data. Historically, only a small fraction of the generated data was shared and re-used, while the majority of data were used once and then erased or archived. At NPL Data Science we are defining best practice in measurement data reuse and traceability by developing metadata standards and data storage structures to locate and interpret datasets and make them available for sharing, publication and data mining.

The talk will shed light on the most burning issues in the scientific data management, and illustrate it with examples from industrial and academic practices. It will present several NPL Data Science projects that focus on delivering confidence in data obtained from life science imaging, medicine, geosciences and fundamental physics.

Speaker Bio:

Dr Marina Romanchikova joined the NPL Data Science team in 2017 to work on data quality and metadata standards. She obtained an MSc in Medical Informatics at University of Heidelberg, Germany, where she specialised in medical image processing and in management of hospital information systems. In 2010 she received a PhD on Monte Carlo dosimetry for targeted radionuclide therapy at the Institute of Cancer Research in Sutton, UK. Marina worked six years as a radiotherapy research physicist at Cambridge University Hospitals where she developed methods for curation and analysis of medical images.

Current interests

– Quantitative quality assessment of medical images and medical image segmentation
– Harmonisation of medical and healthcare data from heterogeneous sources
– Applications of machine learning in healthcare
– Automated data quality assurance

Event details

  • When: 12th March 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Lauren Roberts & Peter Michalák (Newcastle): Automating the Placement of Time Series Models for IoT Healthcare Applications (School Seminar)

Abstract:

There has been a dramatic growth in the number and range of Internet of Things (IoT) sensors that generate healthcare data. These sensors stream high-dimensional time series data that must be analysed in order to provide the insights into medical conditions that can improve patient healthcare. This raises both statistical and computational challenges, including where to deploy the streaming data analytics, given that a typical healthcare IoT system will combine a highly diverse set of components with very varied computational characteristics, e.g. sensors, mobile phones and clouds. Different partitionings of the analytics across these components can dramatically affect key factors such as the battery life of the sensors, and the overall performance. In this work we describe a method for automatically partitioning stream processing across a set of components in order to optimise for a range of factors including sensor battery life and communications bandwidth. We illustrate this using our implementation of a statistical model predicting the glucose levels of type II diabetes patients in order to reduce the risk of hyperglycaemia.

Speaker Bios:

Lauren and Peter are final year PhD students at the CDT in Cloud Computing for Big Data at Newcastle University. Peter has a background in Computer Engineering from University of Žilina, Slovakia and a double-degree in Computer Software Engineering from JAMK University of Applied Sciences, Jyväskylä, Finland. His research interests are within distributed event processing, edge computing and Internet of Things with a special focus on energy and bandwidth constrains. Lauren has an MMath degree from Newcastle University and her research interests lie in statistical modelling of time series data.

Event details

  • When: 26th February 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Quintin Cutts (Glasgow): Re-imagining software engineering education through the apprenticeship lens (School Seminar)

Abstract:

Apprenticeship degrees have sprung up so fast that there has been little time for us all to reflect on how this apparently new form of education, to universities at least, could significantly affect our educational offerings. The University of Glasgow has been undertaking some preparatory work for Skills Development Scotland prior to running its apprenticeship degree in software engineering, and this has afforded us some time to see what others nationally and internationally have been doing, and to consider relevant aspects of the literature, as well as consult with industry. One view that we are developing of these degrees is as a true evolution of typical, largely campus-based, software engineering degrees, towards a full-blown professional degree such as in medicine, where university and hospitals are in real partnership over the training of doctors. In this talk, I will outline our thinking and raise a number of issues for discussion. In suggesting a closer relationship with industry in a talk in St Andrews, I do not of course miss the irony that industry accreditation was never (I believe) something that St Andrews was particularly bothered about – thinking that my BSc (Hons) 1988 is not accredited!

Event details

  • When: 19th February 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Lewis McMillan (St Andrews): Parallel Computer Simulations of Light-Tissue Interactions for Applications in Medicine, Cosmetics Industry and Biophotonics Research (School Seminar)

Abstract:

Tissue ablation is a widely used treatment in both the cosmetic and medical sectors, for treating various diseases or to improve cosmetic outlooks. We present our tissue ablation model which can predict the depth of ablation, and the surrounding thermal damage caused by the laser during ablation.

“Non-diffracting” beams have a multitude of uses in physics, from optical manipulation to improved microscopy light sources. For the first time we show that these beams can be modelled using Monte Carlo radiation transport method. Allowing better insight into how these beams propagate in a turbid medium.

Both of these projects use the Monte Carlo radiation transport method (MCRT) to simulate light transport. The MCRT method is a powerful numerical method that can solve light transport though heavily scattering and absorbing mediums, such as biological tissues. The method is extremely flexible and can model arbitrary geometries and light sources. MCRT can also model the various micro-physics of the simulated medium, such as polarisation, fluorescence, and Raman scattering. This talk will give an overview of our group’s work, with particular focus on simulating tissue ablation, and modelling “non-diffracting” beams.

Speaker Bio:

Lewis McMillan is a final year physics PhD student at St Andrews University. His research interests are in using Monte Carlo radiation transport method for various applications within medicine and biophotonics.

Event details

  • When: 23rd April 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Ian Gent (St Andrews): The Winnability of Klondike and Many Other Single-Player Card Games (School Seminar)

This is joint work with Charlie Blake.

Abstract:

The most famous single-player card game is ‘Klondike’, but our ignorance of its winnability percentage has been called “one of the embarrassments of applied mathematics”. Klondike is just one of many single-player card games, generically called ‘solitaire’ or ‘patience’ games, for which players have long wanted to know how likely a particular game is to be winnable for a random deal. A number of different games have been studied empirically in the academic literature and by non-academic enthusiasts.

Here we show that a single general purpose Artificial Intelligence program, called “Solvitaire”, can be used to determine the winnability percentage of approximately 30 different single-player card games with a 95% confidence interval of ± 0.1% or better. For example, we report the winnability of Klondike to within 0.10% (in the ‘thoughtful’ variant where the player knows the location of all cards). This is a 30-fold reduction in confidence interval, and almost all our results are either entirely new or represent significant improvements on previous knowledge.

Speaker Bio:

Ian Gent is professor of Computer Science at the University of St Andrews. His mother taught him to play patience and herself showed endless patience when he “helped” her by taking complete control of the game. A program to play a patience game was one of the programs he wrote on his 1982 Sinclair Spectrum now on the wall outside his office.

Event details

  • When: 5th February 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Emanuele Trucco (Dundee): Retinal image analysis and beyond in Scotland: the VAMPIRE project (School Seminar)

Abstract:

This talk is an overview of the VAMPIRE (Vessel Assessment and Measurement Platform for Images of the REtina) project, an international and interdisciplinary research initiative created and led by the Universities of Dundee and Edinburgh in Scotland, UK, since the early 2000s. VAMPIRE research focuses on the eye as a source of biomarkers for systemic diseases (e.g. cardiovascular, diabetes, dementia) and cognitive decline, as well as on eye-specific diseases. VAMPIRE is highly interdisciplinary, bringing together medical image analysis, machine learning and data analysis, medical research, and data governance and management at scale. The talk introduces concisely the aims, structure and current results of VAMPIRE, the current vision for effective translation to society, and the several non-technical factors complementing technical research needed to achieve effective translation.

Speaker Bio:

Emanuele (Manuel) Trucco, MSc, PhD, FRSA, FIAPR, is the NRP Chair of Computational Vision in Computing, School of Science and Engineering, at the University of Dundee, and an Honorary Clinical Researcher of NHS Tayside. He has been active since 1984 in computer vision, and since 2002 in medical image analysis, publishing more than 270 refereed papers and 2 textbooks, and serving on the organizing or program committee of major international and UK conferences. Manuel is co-director of VAMPIRE (Vessel Assessment and Measurement Platform for Images of the Retina), an international research initiative led by the Universities of Dundee and Edinburgh (co-director Dr Tom MacGillivray), and part of the UK Biobank Eye andVision Consortium. VAMPIRE develops software tools for efficient data and image analysis with a focus on multi-modal retinal images. VAMPIRE has been used in UK and international biomarker studies on cardiovascular risk, stroke, dementia, diabetes and complications, cognitive performance, neurodegenerative diseases, and genetics.

Event details

  • When: 29th January 2019 14:00 - 15:00
  • Where: Cole 1.33a
  • Series: School Seminar Series
  • Format: Seminar

Population and Behavioural Sciences Division workshop and seminar

Seminar Details
Using Intensive Longitudinal Methods to Study Fear of Breast Cancer Recurrence in Everyday Life
Jean-Philippe Laurenceau, Ph.D.

Venue: Seminar room 1
Date: 14 January 2019
Time: 3-4pm.

Intensive longitudinal methods (also called experience sampling, daily diary, or ecological momentary assessment methods) allow researchers to study people’s health-related behavior, thoughts, and emotions as experienced in their natural contexts. Such data can reveal life as it is actually lived and provide insights that are not possible using conventional experimental or survey research methods. Dr. Laurenceau will review several findings from a study consisting of twice daily assessments of fear of cancer recurrence over 21 days obtained from breast cancer patients and their spouses/partners at key points in the cancer survivorship trajectory. This type of intensive longitudinal research design allows estimation of effects reflecting within-person change (versus between-person differences) in health-related outcomes and supports inferences that have high ecological validity, clinical relevance, and patient focus.

Refreshments will be served from 2.45pm.

Please note if you would like to attend the workshop, please let Karen Hunter know so that she can ensure there is enough space.

Workshop Details
Introduction to Analysing Intensive Longitudinal Data
Jean-Philippe Laurenceau, Ph.D.

Venue: Level 3 meeting room
Date: 14 January 2019
Time: 1-2pm

Intensive longitudinal methods (also called experience sampling, daily diary, or ecological momentary assessment methods) produce data that allow researchers to study people’s behavior, thoughts, and emotions as experienced in their natural contexts. The multilevel or mixed-effects model for longitudinal data is a flexible analytic tool that can take account of complexities stemming from the multiple levels of analysis and temporal dependencies in the data. The goal of this workshop is to provide an overview of a full-cycle treatment of two fundamental research questions that can be addressed using intensive longitudinal methods: (a) What is the time course of the outcome variable, and (b) what is the within-person causal process that underlies the time course? A full-cycle treatment will take workshop participants through five stages of answering each research question: (1) Design study & collect data, (2) Visualize, (3) Analyze, (4) Write up results, and (5) Power the next study. Material for this workshop will be drawn from introductory sections of the presenter’s 2013 Guilford Press book “Intensive Longitudinal Methods: An Introduction to Diary and Experience Sampling Research” (www.intensivelongitudinal.com). Using SPSS and Mplus statistical software, attendees can follow along with the examples in the workshop handout or conduct analyses on their own laptops.

Dr. Jean-Philippe Laurenceau is Unidel A. Gilchrist Sparks III Chair in the Social Sciences and Professor of Psychological & Brain Sciences at the University of Delaware. He is also Senior Research Scientist at CCHS’s Helen F. Graham Cancer Centre and Research Institute. He completed his B.A. cum laude at Cornell University and received his master’s and doctorate degrees from The Pennsylvania State University. Recently, Dr. Laurenceau has been studying how patients and spouses/partners cope with and maintain connection amid health-related adversity, including breast cancer and diabetes. He was an appointed member of a social and behavioural sciences grant review panel of the National Institutes of Health and has been PI or co-I on several research projects funded by the National Institute of Mental Health, National Institute of Child Health and Human Development, and the National Cancer Institute. He regularly teaches methodological workshops at the University of Michigan’s Summer Program in Quantitative Methods of Social Research and at Penn State’s Summer Institute for Longitudinal Methods. Dr. Laurenceau is co-author of the book “Intensive Longitudinal Methods: An Introduction to Diary and Experience Sampling Research” (2013, Guilford Press).

Event details

  • When: 14th January 2019
  • Where: N Haugh, St Andrews
  • Format: Seminar

School Seminar: Jason Alexander (Lancaster University) – What would you do if you could touch your data?

Title:  What would you do if you could touch your data?

Abstract: Data Physicalizations are physical artefacts whose geometry or material properties encode data. They bring digital datasets previously locked behind 2D computer screens out into the physical world, enabling exploration, manipulation, and understanding using our rich tactile senses. My work explores the design and construction of dynamic data physicalizations, where users can interact with physical datasets that dynamically update. I will describe our data physicalization vision and show our progress on designing, building, and evaluating physicalizations and discuss the many exciting challenges faced by this emerging field.

Speaker biography:  Jason is a Senior Lecturer in the School of Computing and Communications at Lancaster University. He has a BSc(Hons) and PhD in Computer Science from the University of Canterbury in New Zealand and was previously a post-doctoral researcher at the University of Bristol. His research is broadly in Human-Computer Interaction, with a particular interest in developing novel interactive systems to bridge the physical-digital divide. His recent work focuses on the development of shape-changing interfaces—surfaces that can dynamically change their geometry based on digital content—and their application to data physicalization. He also has interests in digital fabrication and novel haptic interaction techniques.

Event details

  • When: 29th November 2018 14:00 - 15:00
  • Where: Cole 1.33a
  • Format: Seminar

SRG Seminar: “Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers” by Teng Yu

We present a novel design and implementation of k-means clustering algorithm targeting supercomputers with heterogeneous many-core processors. This work introduces a multi-level parallel partition approach that not only partitions by dataflow and centroid, but also by dimension. Our multi-level ($nkd$) approach unlocks the potential of the hierarchical parallelism in the SW26010 heterogeneous many-core processor and the system architecture of the supercomputer.
Our design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 targeting centroids, while maintaining high performance and high scalability, significantly improving the capability of k-means over previous approaches. The evaluation shows our implementation achieves performance of less than 18 seconds per iteration for a large-scale clustering case with 196,608 data dimensions and 2,000 centroids by applying 4,096 nodes (1,064,496 cores) in parallel, making k-means a more feasible solution for complex scenarios.
This work is to be presented in the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC18).

Event details

  • When: 1st November 2018 13:00 - 14:00
  • Where: Cole 1.33b
  • Series: Systems Seminars Series
  • Format: Seminar, Talk