INDEX
Explanations
mentions of stainless steel and its various applications
New Auto-Interp
Negative Logits
erv
-0.17
ej
-0.15
ftime
-0.15
ersh
-0.15
ansom
-0.14
sey
-0.14
kk
-0.14
tones
-0.14
ei
-0.14
wig
-0.14
POSITIVE LOGITS
steel
0.26
steel
0.22
Steele
0.22
Steel
0.21
Steel
0.18
(es
0.17
PURE
0.16
light
0.16
owe
0.15
ype
0.15
Activations Density 0.004%