INDEX
Explanations
details related to military actions and leadership qualities
New Auto-Interp
Negative Logits
pent
-0.18
lobal
-0.15
Gentle
-0.14
icorn
-0.14
ILT
-0.14
fleets
-0.14
/Gate
-0.13
adies
-0.13
arbon
-0.13
usz
-0.13
POSITIVE LOGITS
enemy
0.21
crawled
0.20
machine
0.20
bay
0.19
baz
0.19
Machine
0.18
grenade
0.18
bay
0.18
wounded
0.18
crawl
0.18
Activations Density 0.035%