INDEX
Explanations
phrases related to geographical and political entities
New Auto-Interp
Negative Logits
ynchronously
-0.18
ỡ
-0.15
what
-0.15
ÑĥÑĩаÑģ
-0.15
whom
-0.14
qos
-0.14
ợ
-0.14
fois
-0.14
anus
-0.14
pleted
-0.13
POSITIVE LOGITS
il
0.46
ils
0.29
'il
0.28
ils
0.28
’il
0.27
_il
0.26
la
0.24
.il
0.24
les
0.23
IL
0.23
Activations Density 0.018%