DAF (SIGMOD 2019)
Han, M., Kim, H., Gu, G., Park, K., & Han, W. S. Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. Proceedings of the 2019 International Conference on Management of Data, 1429–1446.
Database | Domain | V | E | label | avg-deg | Download |
---|---|---|---|---|---|---|
Yeast | PPI | 3,112 | 12,519 | 71 | 8.04 | link |
Human | PPI | 4,674 | 86,282 | 44 | 36.91 | link |
HPRD | PPI | 9,460 | 37,081 | 307 | 7.83 | link |
Communication | 36,692 | 183,831 | 20 | 10.02 | link | |
DBLP | Collaboration | 317,080 | 1,049,866 | 20 | 6.62 | link |
YAGO | Knowledge Graph | 4,295,825 | 11,413,472 | 49,676 | 5.31 | link |
IDAR (VLDB 2021)
Kim, H., Min, S., Park, K., Lin, X., Hong, S. H., & Han, W. S. IDAR: Fast supergraph search using DAG integration. Proceedings of the VLDB Endowment, Volume 13, Issue 9. 1456–1468.
IDAR aims to solve the supergraph search
task, where datasets are composed of multiple graphs. We provide number of connected components (N) and maximum number of vertices (VMax).
Database | Domain | N | VMax | Download |
---|---|---|---|---|
AIDS | Molecule | 42,687 | 438 | link |
NCI | Molecule | 265,242 | 342 | link |
PubChem | Molecule | 499,963 | 801 | link |
SymBi (VLDB 2021)
Min, S., Park, S. G., Park, K., Giammarresi, D., Italiano, G. F., & Han, W. S. Symmetric Continuous Subgraph Matching with Bidirectional Dynamic Programming. Proceedings of the VLDB Endowment, Volume 14, Issue 8. 1298-1310.
Symbi aims to solve the continuous subgraph matching
task, where data is given as insertion/deletion operations on graph data. We provide the number of operation triplets. 90% of operations were considered as initial graphs, and remaining 10% acted as the update operation.
Database | Domain | Num-Triples | Download |
---|---|---|---|
LSBench | RDF | 23,317,563 | link(migrated) link(Archived) |
Netflow | Traffic Traces | 18,520,759 | link |
VEQ (SIGMOD 2021, VLDBJ 2022)
Kim, H., Choi, Y., Park, K., Lin, X., Hong, S. H., & Han, W. S. Versatile Equivalences: Speeding up Subgraph Query Processing and Subgraph Matching. Proceedings of the 2021 International Conference on Management of Data, 925–937.
Subgraph search
datasets contain multiple graphs, which therefore we provide average statistics.
Database | Domain | N | Vavg | Eavg | download |
---|---|---|---|---|---|
PBDS | DNA, RNA, Protein | 600 | 2,939 | 3,064 | link |
PCM | Protein | 200 | 477 | 4,340 | link |
PPI | PPI | 20 | 4,942 | 26,667 | link |
IMDB | Collaboration | 1,500 | 13 | 66 | link |
Social | 4,999 | 509 | 595 | link | |
COLLAB | Collaboration | 5,000 | 74 | 2,457 | link |
For subgraph matching
, please refer to the datasets used in DAF
CRaB (ICDE 2021), DCQ (ICDE 2022)
Gu, G., Nam, Y., Park, K., Galil, Z., Italiano, G. F., & Han, W. S. Scalable graph isomorphism: Combining pairwise color refinement and backtracking via compressed candidate space. 2021 IEEE 37th International Conference on Data Engineering, 1368–1379.
Gu, G., Nam, Y., Park, K., Galil, Z., Italiano, G. F., & Han, W. S. Efficient Graph Isomorphism Query Processing using Degree Sequences and Color-Label Distributions. 2022 IEEE 38th International Conference on Data Engineering, 872–884.
We used the largest connected component when there are more than one.
Database | Domain | V | E | avg-deg | Download |
---|---|---|---|---|---|
Hamster | Social | 1,788 | 12,476 | 13.96 | link |
GrQc | Collaboration | 4,158 | 13,422 | 6.46 | link |
HepTh | Collaboration | 8,638 | 24,806 | 5.74 | link |
Social | 4,039 | 88,234 | 43.69 | link | |
CondMat | Collaboration | 21,363 | 91,286 | 8.55 | link |
HepPh | Collaboration | 11,204 | 117,619 | 21.0 | link |
AstroPh | Collaboration | 17,903 | 196,972 | 22.0 | link |
Brightkite | Social | 56,739 | 212,945 | 7.51 | link |
Plus | Router | 283,872 | 428,384 | 3.02 | link |
Amazon | Co-Purchase | 334,863 | 925,872 | 5.53 | link |
Gowalla | Social | 196,591 | 950,327 | 9.67 | link |
Adaptec | Circuits | 870,532 | 1,874,507 | 4.31 | link |
RoadNet(CA) | Road | 1,957,027 | 2,760,388 | 2.82 | link |
Youtube | Social | 1,134,890 | 2,987,624 | 5.27 | link |
Bigblue | Circuits | 3,795,055 | 8,712,138 | 4.59 | link |
Skitter | Internet | 1,694,616 | 11,094,209 | 13.09 | link |
LiveJournal | Social | 3,997,962 | 34,681,189 | 17.35 | link |