INDEX
Explanations
proper nouns or names mentioned in news articles
references to notable events or accomplishments
New Auto-Interp
Negative Logits
citiz
-0.93
tiss
-0.91
tram
-0.83
screwed
-0.82
proport
-0.82
casc
-0.82
stray
-0.80
rul
-0.80
crunch
-0.79
hydrogen
-0.78
POSITIVE LOGITS
His
2.26
Born
2.07
He
1.99
During
1.66
Prior
1.59
Known
1.58
Contents
1.54
Recently
1.53
Years
1.53
Since
1.52
Activations Density 0.355%