INDEX
Explanations
information about military equipment losses and historical events
New Auto-Interp
Negative Logits
imaru
-0.70
agi
-0.67
iao
-0.67
hap
-0.65
arb
-0.65
laun
-0.64
hub
-0.64
agra
-0.64
hun
-0.63
hur
-0.62
POSITIVE LOGITS
(
1.26
(?,
1.12
(),
1.08
(_
1.05
(
1.04
().
1.04
()
1.02
(%
0.98
('0.97
(-
0.96
Activations Density 0.072%