INDEX
Explanations
phrases related to cooking and grilling techniques
New Auto-Interp
Negative Logits
churn
-0.21
jog
-0.19
smugg
-0.17
launder
-0.17
harass
-0.17
whipped
-0.16
emaker
-0.16
weld
-0.16
quen
-0.15
rehabilit
-0.15
POSITIVE LOGITS
shooting
0.27
banking
0.26
blogging
0.26
logging
0.25
gaming
0.25
printing
0.25
baking
0.24
trading
0.24
betting
0.24
hacking
0.24
Activations Density 0.999%