INDEX
Explanations
references to governmental or institutional agencies
New Auto-Interp
Negative Logits
ÑģÑı
-0.18
uous
-0.18
uously
-0.17
ELY
-0.17
athers
-0.16
spe
-0.15
ances
-0.15
ستاÙĨ
-0.15
ously
-0.15
achines
-0.14
POSITIVE LOGITS
ing
0.27
provoc
0.23
wide
0.23
errupted
0.18
arity
0.18
Ost
0.17
nement
0.16
elling
0.16
wagon
0.16
页éĿ¢åŃĺæ¡£å¤ĩ份
0.16
Activations Density 0.064%