Taxonomy Distribution Report¶
Overview¶
- Total Concepts: 300
- Number of Taxonomies: 15
- Average Concepts per Taxonomy: 20.0
Distribution Summary¶
| Category | TaxonomyID | Count | Percentage | Status |
|---|---|---|---|---|
| GNN | GNN | 44 | 14.7% | ✅ |
| Foundation Concepts - Prerequisites | FOUND | 37 | 12.3% | ✅ |
| ALGO | ALGO | 33 | 11.0% | ✅ |
| KG | KG | 25 | 8.3% | ✅ |
| APP | APP | 23 | 7.7% | ✅ |
| PREREQ | PREREQ | 20 | 6.7% | ✅ |
| TRAIN | TRAIN | 20 | 6.7% | ✅ |
| Advanced Topics | ADV | 18 | 6.0% | ✅ |
| EMB | EMB | 17 | 5.7% | ✅ |
| THEORY | THEORY | 17 | 5.7% | ✅ |
| TRANS | TRANS | 12 | 4.0% | ✅ |
| SCALE | SCALE | 10 | 3.3% | ✅ |
| TOOLS | TOOLS | 9 | 3.0% | ✅ |
| HETERO | HETERO | 8 | 2.7% | ℹ️ Under |
| GEN | GEN | 7 | 2.3% | ℹ️ Under |
Visual Distribution¶
GNN ███████ 44 ( 14.7%)
Foundation Concepts - Pre ██████ 37 ( 12.3%)
ALGO █████ 33 ( 11.0%)
KG ████ 25 ( 8.3%)
APP ███ 23 ( 7.7%)
PREREQ ███ 20 ( 6.7%)
TRAIN ███ 20 ( 6.7%)
Advanced Topics ███ 18 ( 6.0%)
EMB ██ 17 ( 5.7%)
THEORY ██ 17 ( 5.7%)
TRANS ██ 12 ( 4.0%)
SCALE █ 10 ( 3.3%)
TOOLS █ 9 ( 3.0%)
HETERO █ 8 ( 2.7%)
GEN █ 7 ( 2.3%)
Balance Analysis¶
✅ No Over-Represented Categories¶
All categories are under the 30% threshold. Good balance!
ℹ️ Under-Represented Categories (<3%)¶
- HETERO (HETERO): 8 concepts (2.7%)
- Note: Small categories are acceptable for specialized topics
- GEN (GEN): 7 concepts (2.3%)
- Note: Small categories are acceptable for specialized topics
Category Details¶
GNN (GNN)¶
Count: 44 concepts (14.7%)
Concepts:
-
- Graph Neural Network (GNN)
-
- Message Passing Neural Net
-
- Message Function
-
- Aggregation Function
-
- Update Function
-
- Graph Convolutional Network
-
- GraphSAGE
-
- Graph Attention Network (GAT)
-
- Attention Mechanism (Graph)
-
- Multi-Head Attention (Graph)
-
- Graph Isomorphism Network
-
- Sum Aggregation
-
- Mean Aggregation
-
- Max Aggregation
-
- Neighborhood Aggregation
- ...and 29 more
Foundation Concepts - Prerequisites (FOUND)¶
Count: 37 concepts (12.3%)
Concepts:
-
- Graph (Undirected)
-
- Graph (Directed)
-
- Node (Vertex)
-
- Edge (Link)
-
- Adjacency Matrix
-
- Node Degree
-
- In-Degree
-
- Out-Degree
-
- Degree Distribution
-
- Power-Law Degree Distribution
-
- Graph Path
-
- Shortest Path
-
- Graph Diameter
-
- Connected Component
-
- Strongly Connected Component
- ...and 22 more
ALGO (ALGO)¶
Count: 33 concepts (11.0%)
Concepts:
-
- Local Clustering Coefficient
-
- Betweenness Centrality
-
- Closeness Centrality
-
- Eigenvector Centrality
-
- PageRank
-
- Personalized PageRank
-
- HITS Algorithm
-
- Random Walk
-
- Stationary Distribution
-
- Power Iteration
-
- Teleportation (PageRank)
-
- Community Detection
-
- Modularity (Network)
-
- Louvain Algorithm
-
- Girvan-Newman Algorithm
- ...and 18 more
KG (KG)¶
Count: 25 concepts (8.3%)
Concepts:
-
- Knowledge Graph
-
- KG Entity
-
- KG Relation
-
- KG Triple
-
- KG Completion
-
- Link Prediction (KG)
-
- TransE
-
- TransR
-
- DistMult
-
- ComplEx
-
- RotatE
-
- KG Embedding Geometry
-
- Bilinear KG Model
-
- Symmetry (Relation Pattern)
-
- Antisymmetry (Relation)
- ...and 10 more
APP (APP)¶
Count: 23 concepts (7.7%)
Concepts:
-
- Recommender System (Graph)
-
- Collaborative Filtering
-
- Matrix Factorization (Rec)
-
- Neural Collaborative Filter
-
- LightGCN
-
- PinSage
-
- Drug-Drug Interaction
-
- Protein-Protein Interaction
-
- Molecular Graph
-
- Drug Discovery with GNNs
-
- Social Network Analysis
-
- Fraud Detection (Graph)
-
- Traffic Forecasting (Graph)
-
- Scene Graph
-
- Relational Database as Graph
- ...and 8 more
PREREQ (PREREQ)¶
Count: 20 concepts (6.7%)
Concepts:
-
- Matrix Multiplication
-
- Matrix Transpose
-
- Eigenvalue Decomposition
-
- Eigenvector
-
- Symmetric Matrix
-
- Positive Semi-Definite Matrix
-
- Singular Value Decomp (SVD)
-
- Matrix Rank
-
- Dot Product
-
- Cosine Similarity
-
- Gradient Descent
-
- Backpropagation
-
- Chain Rule (Calculus)
-
- Automatic Differentiation
-
- PyTorch Tensor
- ...and 5 more
TRAIN (TRAIN)¶
Count: 20 concepts (6.7%)
Concepts:
-
- Cross-Entropy (Node Class)
-
- Binary Cross-Entropy (LP)
-
- Negative Sampling (LP)
-
- Data Augmentation (Graph)
-
- Dropout (GNN)
-
- Batch Normalization (GNN)
-
- Layer Normalization (GNN)
-
- DropEdge
-
- PairNorm
-
- Early Stopping (GNN)
-
- Train/Val/Test Split (Trans)
-
- Train/Val/Test Split (Ind)
-
- Curriculum Learning (Graph)
-
- Contrastive Loss
-
- Graph Augmentation (SSL)
- ...and 5 more
Advanced Topics (ADV)¶
Count: 18 concepts (6.0%)
Concepts:
-
- In-Context Learning (Graphs)
-
- Conformalized GNN
-
- Graph Foundation Model
-
- LLM + GNN Integration
-
- Text-Attributed Graph
-
- Graph Instruction Tuning
-
- One-For-All (OFA) Model
-
- Agent Memory as Graph
-
- Tool-Use Graph
-
- Relational Deep Learning
-
- RelBench
-
- RelGNN
-
- Self-Supervised Learning
-
- Deep Graph Infomax (DGI)
-
- Graph Contrastive Learning
- ...and 3 more
EMB (EMB)¶
Count: 17 concepts (5.7%)
Concepts:
-
- Node Embedding
-
- Embedding Space
-
- Encoder-Decoder Framework
-
- Shallow Embedding
-
- Matrix Factorization (Graph)
-
- DeepWalk
-
- node2vec
-
- Biased Random Walk
-
- BFS Strategy (node2vec)
-
- DFS Strategy (node2vec)
-
- Skip-Gram Model
-
- Negative Sampling
-
- LINE Embedding
-
- Transductive Learning
-
- Inductive Learning
- ...and 2 more
THEORY (THEORY)¶
Count: 17 concepts (5.7%)
Concepts:
-
- Weisfeiler-Lehman (WL) Test
-
- 1-WL Test
-
- k-WL Test
-
- GNN Expressiveness
-
- Graph Isomorphism Problem
-
- GNN Distinguishing Power
-
- Over-Smoothing
-
- Over-Squashing
-
- GNN Bottleneck
-
- Graph Equivariance
-
- Graph Invariance
-
- Permutation Invariance
-
- Permutation Equivariance
-
- Position-Aware GNN
-
- Identity-Aware GNN
- ...and 2 more
TRANS (TRANS)¶
Count: 12 concepts (4.0%)
Concepts:
-
- Graph Transformer
-
- Graphormer
-
- SAN (Spectral Attention Net)
-
- GPS (General Powerful GNN)
-
- Laplacian Positional Encoding
-
- Random Walk Struct Encoding
-
- Sign-Invariant Network
-
- Basis-Invariant Network
-
- Spatial Bias (Graph Attention)
-
- Centrality Encoding
-
- Edge Encoding (Transformer)
-
- Relative Positional Encoding
SCALE (SCALE)¶
Count: 10 concepts (3.3%)
Concepts:
-
- Mini-Batch Training (GNN)
-
- Neighbor Sampling
-
- GraphSAINT
-
- Cluster-GCN
-
- LADIES Sampler
-
- Layer-Wise Sampling
-
- Historical Embeddings (SIGN)
-
- Graph Partitioning
-
- Graph Sampling Strategy
-
- SIGN Architecture
TOOLS (TOOLS)¶
Count: 9 concepts (3.0%)
Concepts:
-
- PyTorch Geometric (PyG)
-
- NetworkX
-
- Open Graph Benchmark
-
- SNAP
-
- DeepSNAP
-
- DGL
-
- TUDataset
-
- PyKEEN
-
- RelBench (Benchmark)
HETERO (HETERO)¶
Count: 8 concepts (2.7%)
Concepts:
-
- Heterogeneous GNN
-
- R-GCN (Relational GCN)
-
- Heterogeneous Graph Trans
-
- HAN (Hetero Attention Net)
-
- Meta-Path
-
- Relation-Specific Weights
-
- Basis Decomposition (R-GCN)
-
- Type-Specific Projection
GEN (GEN)¶
Count: 7 concepts (2.3%)
Concepts:
-
- Graph Generative Model
-
- GraphRNN
-
- GCPN
-
- Molecule Generation
-
- Variational Autoencoder (VGAE)
-
- DiGress
-
- Graph Generation Metrics
Recommendations¶
- ✅ Excellent balance: Categories are evenly distributed (spread: 12.3%)
- ✅ MISC category minimal: Good categorization specificity
Educational Use Recommendations¶
- Use taxonomy categories for color-coding in graph visualizations
- Design curriculum modules based on taxonomy groupings
- Create filtered views for focused learning paths
- Use categories for assessment organization
- Enable navigation by topic area in interactive tools
Report generated by learning-graph-reports/taxonomy_distribution.py