My PhD thesis “Learning with Graphs using Kernels from Propagated Information” was supervised by Prof. Dr. Kristian Kersting. The members in my PhD committee were Prof. Dr. Stefan Wrobel, Prof. Dr. Christian Bauckhage, and Prof. Dr. Klaus Greve.

ABSTRACT

Traditional machine learning approaches are designed to learn from independent vector-valued data points. The assumption that instances are independent, how- ever, is not always true. On the contrary, there are numerous domains where data points are cross-linked, for example social networks, where persons are linked by friendship relations. These relations among data points make traditional machine learning difficult and often insufficient. Furthermore, data points themselves can have complex structure, for example molecules or proteins constructed from var- ious bindings of different atoms. Networked and structured data are naturally represented by graphs, and for learning we aim to exploit their structure to improve upon non-graph-based methods.

However, graphs encountered in real-world applications often come with rich addi- tional information. This naturally implies many challenges for representation and learning: node information is likely to be incomplete leading to partially labeled graphs, information can be aggregated from multiple sources and can therefore be uncertain, or additional information on nodes and edges can be derived from com- plex sensor measurements, thus being naturally continuous. Although learning with graphs is an active research area, learning with structured data, substantially modeling structural similarities of graphs, mostly assumes fully labeled graphs of reasonable sizes with discrete and certain node and edge information, and learning with networked data, naturally dealing with missing information and huge graphs, mostly assumes homophily and forgets about structural similarity. To close these gaps, we present a novel paradigm for learning with graphs, that exploits the intermediate results of iterative information propagation schemes on graphs. Originally developed for within-network relational and semi-supervised learning, these propagation schemes have two desirable properties: they capture structural information and they can naturally adapt to the aforementioned issues of real-world graph data.

Additionally, information propagation can be efficiently realized by random walks leading to fast, flexible, and scalable feature and kernel computations. Further, by considering intermediate random walk distributions, we can model structural similarity for learning with structured and networked data. We develop several approaches based on this paradigm. In particular, we introduce propagation kernels for learning on the graph level and coinciding walk kernels and Markov logic sets for learning on the node level. Finally, we present two application domains where kernels from propagated information successfully tackle real-world problems.