INDEX
Explanations
instances of punctuation marks followed by proper nouns
New Auto-Interp
Negative Logits
tremend
-0.83
ilater
-0.76
recip
-0.68
halla
-0.67
igue
-0.65
uper
-0.64
sburg
-0.64
shocks
-0.63
upstairs
-0.62
erness
-0.61
POSITIVE LOGITS
Ahead
0.88
Thousands
0.80
Actor
0.78
Yesterday
0.77
Dozens
0.74
Riding
0.74
Following
0.73
Former
0.73
Latest
0.72
Earlier
0.70
Activations Density 0.048%