INDEX
Explanations
instances of contrasting conjunctions indicating shifts in perspective or argumentation
New Auto-Interp
Negative Logits
dum
-0.17
.dw
-0.16
/component
-0.16
ties
-0.15
nia
-0.14
icha
-0.14
داد
-0.14
dao
-0.14
ctl
-0.14
meteor
-0.14
POSITIVE LOGITS
Bernardino
0.15
ire
0.15
ÃĹ↵↵
0.14
esz
0.14
etur
0.14
LIABLE
0.14
ovich
0.14
sem
0.14
oday
0.14
aca
0.14
Activations Density 0.086%