INDEX
Explanations
references to significant historical events and scandals
New Auto-Interp
Negative Logits
обÑĭ
-0.15
_inline
-0.15
_Static
-0.14
oppress
-0.14
assail
-0.14
ÙĪØ§Ø±
-0.14
'gc
-0.14
ATTACK
-0.14
homicides
-0.14
xia
-0.14
POSITIVE LOGITS
scandal
0.42
scandals
0.40
scand
0.32
revelations
0.25
gate
0.24
corruption
0.23
allegations
0.23
controversy
0.22
Gate
0.21
controversies
0.21
Activations Density 0.284%