INDEX
Explanations
references to statistics and numerical data
New Auto-Interp
Negative Logits
Rough
-0.15
γά
-0.15
æĮ¥
-0.15
ÏĦον
-0.15
ammo
-0.15
pst
-0.14
itch
-0.14
ÙĦÛĮسÛĮ
-0.14
üven
-0.14
Fisher
-0.14
POSITIVE LOGITS
eldom
0.16
NECT
0.15
Strom
0.15
ÑĮ
0.15
erb
0.15
rosse
0.14
vore
0.14
ibi
0.14
ade
0.14
adol
0.13
Activations Density 0.005%