INDEX
Explanations
various forms of conjunctions and markers indicating continuity in text
New Auto-Interp
Negative Logits
avax
-0.17
oling
-0.16
Ñıд
-0.15
zin
-0.15
Heb
-0.14
ibia
-0.14
Äįek
-0.14
ieg
-0.14
liament
-0.14
kova
-0.14
POSITIVE LOGITS
ural
0.14
unch
0.14
quo
0.14
Enrique
0.14
_fifo
0.14
upp
0.13
ÅŁt
0.13
hal
0.13
ulas
0.13
ures
0.13
Activations Density 0.002%