INDEX
Explanations
names of people and possibly places
proper nouns, particularly names of people and characters
New Auto-Interp
Negative Logits
Eliot
-0.62
âĹ¼
-0.60
KT
-0.58
Kirk
-0.57
gling
-0.56
anonymity
-0.56
Irving
-0.56
Shattered
-0.56
hed
-0.55
hawk
-0.55
POSITIVE LOGITS
rosso
0.87
iola
0.85
berto
0.85
ondo
0.84
aldi
0.84
udo
0.84
Aless
0.82
arez
0.80
orio
0.80
á
0.79
Activations Density 0.111%