INDEX
Explanations
repetitive conjunctions within the text
New Auto-Interp
Negative Logits
ongs
-0.16
god
-0.15
anes
-0.14
akedown
-0.14
bane
-0.14
án
-0.14
ISIBLE
-0.14
/mol
-0.13
kup
-0.13
sg
-0.13
POSITIVE LOGITS
etine
0.16
umont
0.15
zwar
0.14
itals
0.14
ATUS
0.13
isci
0.13
tep
0.13
opaque
0.13
аÑĢÑĩ
0.13
software
0.13
Activations Density 0.275%