INDEX
Explanations
mentions of historical events or wars, specifically World War II
mentions of historical wars, particularly World Wars
New Auto-Interp
Negative Logits
obook
-0.77
aminer
-0.70
uckland
-0.68
OGR
-0.67
Choice
-0.64
*/(
-0.62
ITY
-0.60
Asset
-0.60
66666666
-0.59
à
-0.58
POSITIVE LOGITS
riors
0.94
lords
0.93
lord
0.92
rior
0.85
fare
0.79
II
0.76
Patton
0.74
bler
0.74
rell
0.71
III
0.71
Activations Density 0.014%