INDEX
Explanations
instances of specific pronouns and temporal markers
New Auto-Interp
Negative Logits
autorytatywna
-0.89
Autoritní
-0.88
CWE
-0.81
__':
-0.81
kaarangay
-0.81
Taktlose
-0.74
OGND
-0.72
propOrder
-0.72
resourceCulture
-0.71
+#+#
-0.70
POSITIVE LOGITS
also
0.85
0.71
0.69
likewise
0.68
'
0.65
.
0.64
همچنین
0.62
同じく
0.62
in
0.60
others
0.58
Activations Density 0.440%