References: Frequent Subgraph Mining¶
-
Subgraph Isomorphism - Wikipedia - Covers the NP-complete decision problem of determining whether one graph contains another as a subgraph, including complexity proofs, canonical algorithms (VF2, Ullmann), and connections to graph homomorphism.
-
Network Motif - Wikipedia - Explains recurring statistically over-represented subgraph patterns in biological and social networks, covering Z-score computation, null model construction, and landmark studies in transcriptional regulation networks.
-
Graphlet - Wikipedia - Describes small connected non-isomorphic induced subgraphs used as features for network comparison and node/edge roles, with coverage of graphlet degree vectors and the graphlet kernel for graph classification.
-
Mining of Massive Datasets - Leskovec, Rajaraman, Ullman - Cambridge University Press - Chapter 10 covers frequent itemset and subgraph mining at scale, providing the algorithmic foundations (Apriori, anti-monotonicity) that underpin systems like gSpan and SPMiner.
-
Graph Theory - Diestel - Springer - The standard rigorous reference for graph isomorphism, subgraph and minor relations, and the Ramsey-type combinatorics that characterize the hardness landscape of subgraph containment problems.
-
SPMiner: Neural Subgraph Matching (Ying et al., 2020) - arXiv - Introduces SPMiner, the order-embedding approach that maps graphs into a partially ordered embedding space so that subgraph containment becomes a max-margin geometric query, enabling sub-millisecond frequency estimation on large graphs.
-
Order Embeddings of Images and Language (Vendrov et al., 2016) - arXiv - Foundational paper on order-preserving embeddings for partial orders; SPMiner directly adapts this framework to encode the subgraph containment partial order between graph neighborhoods.
-
Efficient Graphlet Counting for Large Networks (Ahmed et al., 2015) - arXiv - Presents scalable algorithms for counting 4- and 5-node graphlets in networks with millions of edges, including combinatorial identities that reduce enumeration cost and a comparison to exact versus sampling-based approaches.
-
PyTorch Geometric — Subgraph Utilities - PyTorch Geometric Docs - Documents
torch_geometric.utils.subgraph,k_hop_subgraph, and related functions used to extract neighborhood subgraphs for training and evaluating order-embedding models like SPMiner. -
Papers With Code — Subgraph Isomorphism - Papers With Code - Aggregates benchmarks, leaderboards, and linked implementations for subgraph isomorphism and neural subgraph matching, including SPMiner results on the ENZYMES, COX2, and BZR datasets used in the literature.