INDEX
Explanations
references to government entities or officials
New Auto-Interp
Negative Logits
odb
-0.15
pek
-0.15
gele
-0.14
ÙıÙĨ
-0.14
ì¹
-0.14
ucz
-0.14
.Java
-0.13
ees
-0.13
contexts
-0.13
istica
-0.13
POSITIVE LOGITS
t
0.30
ornment
0.26
’t
0.24
't
0.23
inda
0.22
ern
0.22
ind
0.21
inds
0.20
ender
0.20
orn
0.19
Activations Density 0.005%