INDEX
Explanations
locations or organizations mentioned in news articles
instances of commas and punctuation in lists
New Auto-Interp
Negative Logits
shades
-0.65
macros
-0.64
exams
-0.63
terms
-0.62
accounts
-0.62
pointers
-0.61
tones
-0.61
levels
-0.61
cues
-0.61
spores
-0.60
POSITIVE LOGITS
MEN
1.03
CITY
0.96
COURT
0.89
DISTRICT
0.88
JUL
0.87
INC
0.84
INC
0.82
FANTASY
0.82
CHRIST
0.82
MEN
0.81
Activations Density 0.066%