INDEX
Explanations
names, particularly those associated with "Lind" and its variations
New Auto-Interp
Negative Logits
ivas
-0.17
ssize
-0.16
iliate
-0.16
epar
-0.16
ãĥ³ãĤº
-0.16
loat
-0.15
ileged
-0.15
аÑĢаÑĤ
-0.14
atham
-0.14
eel
-0.14
POSITIVE LOGITS
gren
0.37
quist
0.34
emann
0.32
strom
0.31
ber
0.30
holm
0.29
erman
0.27
qv
0.27
ahl
0.27
enberg
0.27
Activations Density 0.008%