INDEX
Explanations
proper nouns, specifically names of individuals
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
illy
-0.81
agra
-0.78
apeshifter
-0.77
information
-0.71
abol
-0.70
avers
-0.70
file
-0.69
creen
-0.69
eeper
-0.69
elling
-0.68
POSITIVE LOGITS
sson
1.48
Wilhelm
1.20
Hitler
1.20
Schwarzenegger
1.16
Herz
1.12
Engels
1.11
von
1.08
Sch
1.07
Ludwig
1.07
Schwarz
1.06
Activations Density 0.148%