INDEX
Explanations
references to political interventions and their consequences
negativity or harm
negatively affecting outcomes
New Auto-Interp
Negative Logits
suddenly
-0.53
suddenly
-0.53
estekak
-0.52
XmlAccessType
-0.51
soudain
-0.51
unexpected
-0.50
httphttps
-0.50
Unexpected
-0.49
plötzlich
-0.49
+#+#
-0.49
POSITIVE LOGITS
detrimental
1.02
damaging
0.99
harmful
0.97
destructive
0.95
doomed
0.92
disastrous
0.90
harm
0.89
damage
0.87
deleterious
0.86
detriment
0.85
Activations Density 0.506%