INDEX
Explanations
occurrences of numerical values or quantities in the text
New Auto-Interp
Negative Logits
ongo
-0.16
ocop
-0.16
yne
-0.15
apiro
-0.14
Harm
-0.14
apt
-0.14
ing
-0.14
osu
-0.14
ych
-0.14
spe
-0.14
POSITIVE LOGITS
orden
0.17
qua
0.15
TOD
0.15
-sama
0.15
theless
0.15
.ci
0.14
latter
0.14
âĸ¡âĸ¡
0.14
urs
0.13
ullah
0.13
Activations Density 0.036%