INDEX
Explanations
mentions of conflicts, wars, and crises
references to conflicts, particularly wars and crises
New Auto-Interp
Negative Logits
prints
-0.80
mson
-0.78
Examination
-0.73
membr
-0.70
asonic
-0.68
Laboratories
-0.68
redit
-0.68
copies
-0.67
Magikarp
-0.67
patents
-0.67
POSITIVE LOGITS
raging
1.12
ravaged
1.06
raged
0.94
waged
0.89
rav
0.88
unfolding
0.87
fighting
0.87
aftermath
0.84
fighting
0.82
refugee
0.82
Activations Density 0.100%