INDEX
Explanations
instances of strong contrasts or transitions in text
New Auto-Interp
Negative Logits
chen
-0.18
ни
-0.16
therefore
-0.15
amo
-0.14
ining
-0.14
dd
-0.14
:,
-0.13
mage
-0.13
o
-0.13
chef
-0.13
POSITIVE LOGITS
że
0.21
tery
0.16
leyen
0.14
Ñľ
0.14
-syntax
0.14
abol
0.14
arith
0.13
wenn
0.13
XMLElement
0.13
artz
0.13
Activations Density 0.036%