INDEX
Explanations
mentions related to historical or current geopolitical events and military actions
New Auto-Interp
Negative Logits
Dise
-0.75
phies
-0.68
aca
-0.67
atars
-0.67
aji
-0.66
"$:/
-0.66
sections
-0.65
ollo
-0.65
acebook
-0.64
athetic
-0.63
POSITIVE LOGITS
countered
0.77
resorted
0.77
claim
0.71
reluct
0.68
retali
0.68
repe
0.67
penchant
0.66
practiced
0.66
pursu
0.65
threw
0.65
Activations Density 9.701%