INDEX
Explanations
references to the luxury automotive brand "Lamborghini."
references to the name "Lam" in various contexts
New Auto-Interp
Negative Logits
ģĸ
-0.69
GROUND
-0.67
REDACTED
-0.65
ãģį
-0.65
steroid
-0.64
Subtle
-0.63
steroids
-0.63
gradient
-0.59
inhibition
-0.58
override
-0.57
POSITIVE LOGITS
borgh
1.68
inated
1.22
ont
1.09
pling
1.06
pton
1.05
oufl
1.05
onte
1.04
azing
1.03
pert
1.02
inished
1.00
Activations Density 0.024%