INDEX
Explanations
references to wine and its cultural significance
New Auto-Interp
Negative Logits
seins
-0.08
fg
-0.07
bens
-0.06
abant
-0.06
icari
-0.06
swer
-0.06
astos
-0.06
kün
-0.06
tesy
-0.06
Store
-0.06
POSITIVE LOGITS
producers
0.07
wines
0.07
stylist
0.07
ries
0.07
produce
0.06
disg
0.06
producing
0.06
output
0.06
produce
0.06
iles
0.06
Activations Density 0.008%