INDEX
Explanations
mentions of locations or scenarios related to military or war
references to battlefields and related warfare concepts
New Auto-Interp
Negative Logits
perm
-0.79
clips
-0.79
pert
-0.76
uana
-0.74
bery
-0.70
ITY
-0.69
credit
-0.68
Reviewer
-0.67
loo
-0.67
iris
-0.66
POSITIVE LOGITS
battlefield
0.91
commanders
0.87
1942
0.86
commander
0.86
trenches
0.81
1944
0.79
combat
0.77
1941
0.74
discharge
0.74
hardened
0.74
Activations Density 0.122%