INDEX
Explanations
references to characters or concepts related to martial arts and warfare
references to soldiers and characters associated with military themes
New Auto-Interp
Negative Logits
itching
-0.84
eking
-0.81
reserve
-0.72
oulos
-0.72
andre
-0.70
oard
-0.70
oing
-0.68
ighting
-0.66
mint
-0.66
itions
-0.65
POSITIVE LOGITS
Soldier
0.93
Spy
0.82
vana
0.82
Mode
0.79
Nation
0.77
Jet
0.77
Agent
0.76
Generations
0.73
Girl
0.72
Tracker
0.72
Activations Density 0.032%