INDEX
Explanations
actions related to violent events
violent actions and their consequences
New Auto-Interp
Negative Logits
ibrary
-0.73
Founders
-0.63
erala
-0.62
urers
-0.60
nowadays
-0.60
Historically
-0.59
terday
-0.58
Founding
-0.57
quartered
-0.57
historically
-0.56
POSITIVE LOGITS
livion
0.69
unsuspecting
0.68
))))
0.68
VERTISEMENT
0.66
EXT
0.64
zzle
0.64
blinding
0.63
)))
0.60
eming
0.60
ggle
0.59
Activations Density 1.056%