INDEX
Explanations
words related to names, particularly last names
words related to individuals with the surname "ini."
New Auto-Interp
Negative Logits
¥µ
-0.86
spin
-0.75
é¾įå¥ij士
-0.74
ĻĤ
-0.71
saw
-0.70
lished
-0.69
friends
-0.68
names
-0.68
Ĥ¬
-0.67
deck
-0.67
POSITIVE LOGITS
zzle
1.01
ini
0.94
emi
0.91
zzo
0.91
Äĩ
0.88
zzi
0.87
opsis
0.87
otti
0.87
Rossi
0.86
ÃŁ
0.84
Activations Density 0.010%