INDEX
Explanations
quantitative comparisons and measurements related to weight, water, and population statistics
New Auto-Interp
Negative Logits
šak
-0.15
somehow
-0.15
whenever
-0.14
yz
-0.14
دÛĮگرÛĮ
-0.14
plib
-0.14
sak
-0.14
Ñıв
-0.14
éı¡
-0.14
979
-0.13
POSITIVE LOGITS
typical
0.56
average
0.46
Typical
0.44
typ
0.44
average
0.41
Average
0.35
Average
0.34
åħ¸
0.31
-average
0.31
Typ
0.31
Activations Density 0.308%