INDEX
Explanations
names of individuals
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
berra
-0.56
âĢº
-0.55
uminati
-0.54
Archdemon
-0.53
Interested
-0.52
Paran
-0.50
attm
-0.49
Flavoring
-0.49
Slayer
-0.49
taboola
-0.48
POSITIVE LOGITS
kson
0.63
espie
0.62
recalled
0.59
laughed
0.55
vetoed
0.54
enson
0.53
yden
0.51
wrote
0.51
conceded
0.51
detractors
0.51
Activations Density 0.417%