Doctoral Dissertations

Off-campus UMass Amherst users: To download campus access dissertations, please use the following link to log into our proxy server with your UMass Amherst user name and password.

Non-UMass Amherst users: Please talk to your librarian about requesting this dissertation through interlibrary loan.

Dissertations that have an embargo placed on them will not be available to anyone until the embargo expires.

Foundations of Node Representation Learning

Sudhanshu Chanpuriya, University of Massachusetts AmherstFollow

Author ORCID Identifier

https://orcid.org/0000-0001-8505-4072

AccessType

Open Access Dissertation

Document Type

dissertation

Degree Name

Doctor of Philosophy (PhD)

Degree Program

Computer Science

Year Degree Awarded

2023

Month Degree Awarded

September

First Advisor

Cameron Musco

Subject Categories

Artificial Intelligence and Robotics | Data Science | Theory and Algorithms

Abstract

Low-dimensional node representations, also called node embeddings, are a cornerstone in the modeling and analysis of complex networks. In recent years, advances in deep learning have spurred development of novel neural network-inspired methods for learning node representations which have largely surpassed classical 'spectral' embeddings in performance. Yet little work asks the central questions of this thesis: Why do these novel deep methods outperform their classical predecessors, and what are their limitations? We pursue several paths to answering these questions. To further our understanding of deep embedding methods, we explore their relationship with spectral methods, which are better understood, and show that some popular deep methods are equivalent to spectral methods in a certain natural limit. We also introduce the problem of inverting node embeddings in order to probe what information they contain. Further, we propose a simple, non-deep method for node representation learning, and find it to often be competitive with modern deep graph networks in downstream performance. To better understand the limitations of node embeddings, we prove some upper and lower bounds on their capabilities. Most notably, we prove that node embeddings are capable of exact low-dimensional representation of networks with bounded max degree or arboricity, and we further show that a simple algorithm can find such exact embeddings for real-world networks. By contrast, we also prove inherent bounds on random graph models, including those derived from node embeddings, to capture key structural properties of networks without simply memorizing a given graph.

DOI

https://doi.org/10.7275/35892940

Recommended Citation

Chanpuriya, Sudhanshu, "Foundations of Node Representation Learning" (2023). Doctoral Dissertations. 2968.
https://doi.org/10.7275/35892940 https://scholarworks.umass.edu/dissertations_2/2968

Download

Included in

Artificial Intelligence and Robotics Commons, Data Science Commons, Theory and Algorithms Commons

COinS

ScholarWorks@UMass Amherst

Doctoral Dissertations

Foundations of Node Representation Learning

Author ORCID Identifier

AccessType

Document Type

Degree Name

Degree Program

Year Degree Awarded

Month Degree Awarded

First Advisor

Subject Categories

Abstract

DOI

Recommended Citation

Included in

Browse

Author Corner

Links

ScholarWorks@UMass Amherst

Doctoral Dissertations

Foundations of Node Representation Learning

Author

Author ORCID Identifier

AccessType

Document Type

Degree Name

Degree Program

Year Degree Awarded

Month Degree Awarded

First Advisor

Subject Categories

Abstract

DOI

Recommended Citation

Included in

Share

Browse

Author Corner

Links