SNAP/cit-HepTh
Arxiv High Energy Physics paper citation network
Name |
cit-HepTh |
Group |
SNAP |
Matrix ID |
2293 |
Num Rows
|
27,770 |
Num Cols
|
27,770 |
Nonzeros
|
352,807 |
Pattern Entries
|
352,807 |
Kind
|
Directed Graph |
Symmetric
|
No |
Date
|
2003 |
Author
|
J. Gehrke, P. Ginsparg, J. Kleinberg |
Editor
|
J. Leskovec |
Structural Rank |
|
Structural Rank Full |
|
Num Dmperm Blocks
|
|
Strongly Connect Components
|
20,086 |
Num Explicit Zeros
|
0 |
Pattern Symmetry
|
0.3% |
Numeric Symmetry
|
0.3% |
Cholesky Candidate
|
no |
Positive Definite
|
no |
Type
|
binary |
SVD Statistics |
Matrix Norm |
8.516069e+01 |
Minimum Singular Value |
6.734437e-66 |
Condition Number |
1.264555e+67
|
Rank |
21,616 |
sprank(A)-rank(A) |
|
Null Space Dimension |
6,154 |
Full Numerical Rank? |
no |
Download Singular Values |
MATLAB
|
Download |
MATLAB
Rutherford Boeing
Matrix Market
|
Notes |
Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,
Jure Leskovec http://snap.stanford.edu/data/index.html
email jure at cs.stanford.edu
High-energy physics theory citation network
Dataset information
Arxiv HEP-TH (high energy physics theory) citation graph is from the e-print
arXiv and covers all the citations within a dataset of 27,770 papers with
352,807 edges. If a paper i cites paper j, the graph contains a directed edge
from i to j. If a paper cites, or is cited by, a paper outside the dataset, the
graph does not contain any information about this.
The data covers papers in the period from January 1993 to April 2003 (124
months). It begins within a few months of the inception of the arXiv, and thus
represents essentially the complete history of its HEP-TH section.
The data was originally released as a part of 2003 KDD Cup.
Dataset statistics
Nodes 27770
Edges 352807
Nodes in largest WCC 27400 (0.987)
Edges in largest WCC 352542 (0.999)
Nodes in largest SCC 7464 (0.269)
Edges in largest SCC 116268 (0.330)
Average clustering coefficient 0.3295
Number of triangles 1478735
Fraction of closed triangles 0.1196
Diameter (longest shortest path) 14
90-percentile effective diameter 5.4
Source (citation)
J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time: Densification
Laws, Shrinking Diameters and Possible Explanations. ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining (KDD), 2005.
J. Gehrke, P. Ginsparg, J. M. Kleinberg. Overview of the 2003 KDD Cup. SIGKDD
Explorations 5(2): 149-151, 2003.
Files
File Description
cit-HepTh.txt.gz Paper citation network of Arxiv High Energy Physics Theory
category
cit-HepTh-dates.txt.gz Time of nodes (paper submission time to Arxiv)
cit-HepTh-abstracts.tar.gz Paper meta information (see below)
Example of paper meta information:
Paper: hep-th/0002031
From: Maulik K. Parikh
Date: Fri, 4 Feb 2000 17:04:51 GMT (10kb)
Title: Confinement and the AdS/CFT Correspondence
Authors: D. S. Berman and Maulik K. Parikh
Comments: 12 pages, 1 figure, RevTeX
Report-no: SPIN-1999/25, UG-1999/42
Journal-ref: Phys.Lett. B483 (2000) 271-276
\
We study the thermodynamics of the confined and unconfined phases of
superconformal Yang-Mills in finite volume and at large N using the AdS/CFT
correspondence. We discuss the necessary conditions for a smooth phase
crossover and obtain an N-dependent curve for the phase boundary.
\
NOTE: an identical version this problem already appears in the UF Collection:
cit-HepTh Pajek/HEP-th-new is identical to the
SNAP/cit-HeptTh graph. Since it's small, I have
decided to include both in the collection, to
keep the SNAP/ collection complete.
n: 27,770 nz: 352,807
|