INDEX
Explanations
text related to military operations or terms
references to specific geographic locations or entities related to conflicts and terrorism
New Auto-Interp
Negative Logits
Tycoon
-0.71
kefeller
-0.70
blem
-0.68
Cly
-0.67
ulators
-0.67
avorite
-0.64
Liver
-0.64
eryl
-0.64
olars
-0.64
incinn
-0.63
POSITIVE LOGITS
eki
0.75
Grab
0.67
ĪĴ
0.66
AZ
0.62
esi
0.62
sha
0.61
iste
0.60
azi
0.60
Lago
0.59
Az
0.59
Activations Density 0.359%