INDEX
Explanations
words related to people's names, particularly focusing on names that start with a capital letter and consist of two parts
proper nouns and character names in a movie or narrative context
New Auto-Interp
Negative Logits
ashtra
-0.75
FML
-0.74
estinal
-0.74
urches
-0.74
REL
-0.73
yrinth
-0.69
olesterol
-0.67
ICAN
-0.66
eele
-0.65
HDL
-0.65
POSITIVE LOGITS
Ott
0.79
Redditor
0.74
hardt
0.73
bye
0.69
theless
0.68
Wax
0.65
mot
0.64
woods
0.64
leaf
0.63
Toad
0.63
Activations Density 0.349%