INDEX
Explanations
references to the word "Beer."
references to beer
New Auto-Interp
Negative Logits
uncture
-0.96
iculty
-0.75
itives
-0.73
resses
-0.73
ean
-0.72
ktop
-0.70
etheless
-0.69
urities
-0.69
SPONSORED
-0.69
eanor
-0.68
POSITIVE LOGITS
Beer
1.11
Beer
0.93
bell
0.77
beer
0.73
Gim
0.73
Barrel
0.72
wine
0.72
haus
0.71
leigh
0.70
Lover
0.69
Activations Density 0.009%