INDEX
Explanations
the word "stealth" or variations of it
terms related to stealth and health
New Auto-Interp
Negative Logits
Pwr
-0.77
ICAN
-0.74
ONT
-0.73
ENTS
-0.71
artney
-0.70
onz
-0.68
ktop
-0.63
Interstitial
-0.63
ocese
-0.62
¢
-0.61
POSITIVE LOGITS
ily
1.14
ively
1.00
ibility
0.97
iest
0.95
camouflage
0.94
bomber
0.91
iness
0.90
door
0.89
stealth
0.85
y
0.84
Activations Density 0.041%