INDEX
Explanations
mentions of luxury car brands
mentions of the word "Lam" and its variants
New Auto-Interp
Negative Logits
Subtle
-0.89
ãģį
-0.80
Occupations
-0.73
Marketable
-0.71
ģĸ
-0.69
Fiction
-0.69
IBLE
-0.68
Decay
-0.66
FORM
-0.63
dit
-0.62
POSITIVE LOGITS
borgh
1.50
ont
1.02
amo
0.93
inated
0.92
oran
0.91
ouse
0.90
uci
0.89
Lam
0.88
mer
0.87
othe
0.87
Activations Density 0.006%