Scalable Graph Distances

Representations of real-world phenomena as graphs are ubiquitous, ranging from social and information networks, to technological, biological, chemical, and brain networks. Many graph mining tasks — including clustering, anomaly detection, nearest neighbor, similarity search, pattern recognition, and transfer learning — require a distance measure between graphs to be computed efficiently. The existing distance measures between graphs leave a lot to be desired. They are overwhelmingly based on heuristics. Many do not scale to graphs with millions of nodes; others do not satisfy the metric properties of non-negativity, positive definiteness, symmetry, and triangle inequality. This project studies a formal mathematical foundation covering a family of graph distances that overcome these limitations, focusing on real-world applications in biology and social network analysis. It also provides a universal methodology for parallelizing the computation of graph distance metrics within this family over massive graphs with millions of nodes, and scaling it over cloud computing resources.

Principal Investigators: Stratis Ioannidis (DNAL), Tina Elliasi-Rad (Network Science Institute/CCIS, Northeastern University), Jose Bento (Boston College).

Funding: National Science Foundation, Google Cloud Services (IIS-1741197).

Research Projects

Human-Robot Object Handover

Coordination of Dyadic Object Handover for Human-Robot Interactions is a project funded by NSF. In collaboration with Tunik and RIVER Labs at Northeastern, we are modeling natural human-to-human object handover dynamics in order to develop robotic behavior strategies for more human-like human-to-robot and robot-to-human object handover in human-robot teams of the future.

Research Projects

Estimating Protein Function From Structure

Mining for Mechanistic Information to Predict Protein Function is a project funded by NSF. In collaboration with researchers from the Chemistry Department, we are using machine learning techniques to develop computational models that can predict protein function from chemical and molecular structure. Models will also be explainable in the sense that active residues will be identified and their roles will be connected to predicted protein function.

Research Projects

Predicting Epileptogenesis After TBI

Multimodal Signal Analysis and Data Fusion for Post-traumatic Epilepsy Prediction is a project funded by NIH. In collaboration with researchers at USC Medical School, we are using machine learning techniques to discover features from multimodal data such as EEG, fMRI, DTI, and blood chemistry, in order to build models that can predict if a traumatic brain injury (TBI) patient is susceptible to epileptogenesis – emergence of epilepsy following TBI.

Scalable Graph Distances

Published by admin on March 24, 2018March 24, 2018

Related Posts

Research Projects

Human-Robot Object Handover

Research Projects

Estimating Protein Function From Structure

Research Projects

Predicting Epileptogenesis After TBI