INDEX
Explanations
phrases related to wars and battles
references to war-related titles and themes
New Auto-Interp
Negative Logits
ciating
-0.73
Helpful
-0.71
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.69
ħĭ
-0.68
*/(
-0.67
©¶æ¥µ
-0.67
TOP
-0.66
itte
-0.65
Publisher
-0.65
ograp
-0.64
POSITIVE LOGITS
enegger
1.00
schild
0.82
attrition
0.81
istan
0.79
Afghanistan
0.79
Armageddon
0.77
raged
0.74
adesh
0.72
torn
0.69
opium
0.68
Activations Density 0.207%