INDEX
Explanations
news headlines including the abbreviation "AP"
occurrences of citations or references in news articles
New Auto-Interp
Negative Logits
angan
-0.62
course
-0.62
antha
-0.61
ts
-0.58
enges
-0.57
scale
-0.57
Shape
-0.56
tons
-0.55
irection
-0.54
Scroll
-0.52
POSITIVE LOGITS
CITY
0.74
PROV
0.70
COUNTY
0.70
STATE
0.67
COL
0.67
+++
0.66
—
0.64
MARK
0.62
COR
0.62
COL
0.61
Activations Density 0.027%