INDEX
Explanations
names starting with Fran, Franc, Frank
New Auto-Interp
Negative Logits
cortina
-0.83
boîte
-0.79
atop
-0.75
where
-0.75
ухода
-0.73
ודי
-0.70
young
-0.69
Trung
-0.69
hinted
-0.69
pfle
-0.69
POSITIVE LOGITS
fran
1.02
Fran
0.99
Fran
0.91
franc
0.88
kende
0.87
fran
0.84
Laughter
0.84
">-
0.83
Franken
0.82
imanapun
0.82
Activations Density 0.012%