INDEX
Explanations
names of specific individuals
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
imately
-0.83
ashtra
-0.74
vous
-0.65
ï¸ı
-0.64
yip
-0.63
broom
-0.63
BLIC
-0.62
minecraft
-0.61
Haram
-0.59
LEDs
-0.59
POSITIVE LOGITS
asley
0.67
Marsh
0.64
ukes
0.64
canon
0.63
Kelley
0.63
vine
0.60
mberg
0.59
Cullen
0.57
oyer
0.57
Johnston
0.57
Activations Density 0.088%