INDEX
Explanations
numerical data and percentages
New Auto-Interp
Negative Logits
abant
-0.15
Verdana
-0.14
ondere
-0.13
ĥĿ
-0.13
Tanz
-0.13
itung
-0.13
nda
-0.13
buz
-0.13
otr
-0.13
au
-0.12
POSITIVE LOGITS
ensen
0.16
ény
0.15
oint
0.14
nier
0.14
angelo
0.14
linger
0.14
constant
0.14
æģĴ
0.14
jer
0.13
amar
0.13
Activations Density 0.005%