INDEX
Explanations
mentions of significant actions or beliefs regarding war and economic implications
New Auto-Interp
Negative Logits
minded
-0.15
ointments
-0.14
nth
-0.14
ãĥ¼ãĤ¸
-0.14
oint
-0.14
ÑĨеп
-0.14
leton
-0.14
oda
-0.13
_cf
-0.13
rch
-0.13
POSITIVE LOGITS
ãĥ¼ãĥĦ
0.16
acula
0.15
oner
0.15
Busty
0.15
erten
0.15
vor
0.15
atern
0.14
Hollow
0.14
,eg
0.14
ourke
0.14
Activations Density 0.083%