INDEX
Explanations
mentions of alcoholic beverages
mentions of alcoholic beverages, particularly liquor
New Auto-Interp
Negative Logits
pta
-0.67
ding
-0.66
knit
-0.64
humans
-0.62
ECD
-0.61
arrass
-0.61
Speech
-0.60
DIR
-0.59
rosis
-0.59
Aj
-0.59
POSITIVE LOGITS
essee
1.04
ice
0.96
ocaust
0.90
opoly
0.87
liquor
0.86
Stores
0.83
beverage
0.83
cohol
0.83
oca
0.81
bottles
0.80
Activations Density 0.037%