INDEX
Explanations
references to specific events or actions taken by institutions or individuals
New Auto-Interp
Negative Logits
erm
-0.17
anga
-0.17
somebody
-0.15
Ë
-0.15
ume
-0.15
rumor
-0.15
clusions
-0.14
imi
-0.13
ÑĢеÑī
-0.13
loom
-0.13
POSITIVE LOGITS
remarks
0.30
comments
0.26
statements
0.25
statement
0.22
Remarks
0.21
Tuesday
0.20
remarks
0.20
Thursday
0.20
response
0.20
comments
0.20
Activations Density 0.083%