INDEX
Explanations
words related to conflicts, violence, and international affairs
New Auto-Interp
Negative Logits
Trog
-0.66
arde
-0.64
footing
-0.61
engagements
-0.59
ennes
-0.59
disapprove
-0.59
communications
-0.57
extrad
-0.57
engagement
-0.56
equip
-0.56
POSITIVE LOGITS
³³³³
1.23
³³³³³³³³
1.21
³³³
1.17
³³³³³³³³³³³³³³³³
1.16
Article
1.00
Advertisement
1.00
advertisement
0.95
³³
0.91
Anyway
0.89
Nevertheless
0.87
Activations Density 1.117%