INDEX
Explanations
references to different types of alcoholic beverages, particularly wines
references to wine and wine-related terminology
New Auto-Interp
Negative Logits
aneously
-0.87
aneous
-0.85
urtle
-0.72
uled
-0.71
uli
-0.68
adden
-0.68
urat
-0.68
pta
-0.68
rencies
-0.68
ulation
-0.68
POSITIVE LOGITS
vinegar
1.12
tasting
1.12
grapes
1.05
wine
1.02
cellar
0.96
tast
0.93
wine
0.92
grape
0.87
vine
0.87
wines
0.86
Activations Density 0.025%