INDEX
Explanations
references to wars and conflicts
New Auto-Interp
Negative Logits
omik
-0.17
ichert
-0.15
еÑĢеÑĩ
-0.15
achat
-0.15
Äįan
-0.14
pressured
-0.14
visa
-0.14
stripslashes
-0.14
warts
-0.14
utow
-0.14
POSITIVE LOGITS
fought
0.31
waged
0.31
raging
0.24
rage
0.22
heating
0.22
wages
0.22
rag
0.21
conducted
0.21
pit
0.19
intensity
0.19
Activations Density 0.158%