INDEX
Explanations
conjunctions and relational phrases connecting various subjects and aspects in the text
New Auto-Interp
Negative Logits
aux
-0.20
_compat
-0.15
$?
-0.15
ognito
-0.14
èĩªåĬ¨çĶŁæĪIJ
-0.14
INCIDENTAL
-0.14
odos
-0.14
alker
-0.14
Aux
-0.14
æµ®
-0.14
POSITIVE LOGITS
erst
0.21
former
0.18
ello
0.16
atio
0.16
946
0.16
azio
0.15
0.15
218
0.15
~
0.15
ograf
0.14
Activations Density 0.034%