INDEX
Explanations
mentions of the word "war."
instances of the word "War" and related numerical values or indicators
New Auto-Interp
Negative Logits
essee
-0.92
ħĭ
-0.76
OGR
-0.67
drawer
-0.64
Ħ¢
-0.64
explan
-0.61
wired
-0.61
Seller
-0.61
livest
-0.59
topp
-0.59
POSITIVE LOGITS
rior
1.50
riors
1.46
fare
1.27
locks
1.22
lords
1.20
ped
1.17
ping
1.16
lord
1.15
lock
1.07
bler
1.02
Activations Density 0.038%