INDEX
Explanations
consecutive conjunctions in sentences
New Auto-Interp
Negative Logits
allerdings
-0.22
både
-0.20
jedoch
-0.18
ocre
-0.18
však
-0.16
however
-0.16
ÑģÑĮ
-0.15
richt
-0.15
jednak
-0.14
and
-0.14
POSITIVE LOGITS
/or
1.02
/OR
0.61
/of
0.43
rogen
0.40
rew
0.39
rog
0.38
/o
0.38
наÑĩе
0.36
ä¸Ķ
0.34
erson
0.33
Activations Density 2.009%