INDEX
Explanations
people's names
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
Ö
-0.61
Interested
-0.59
lihood
-0.58
6000
-0.55
uminati
-0.55
https
-0.54
ettle
-0.54
berra
-0.53
..."
-0.52
_-_
-0.51
POSITIVE LOGITS
heirs
0.62
added
0.61
added
0.61
prevailed
0.61
spokeswoman
0.60
spokesman
0.59
conceded
0.58
lamented
0.56
boasted
0.56
admitted
0.55
Activations Density 0.910%