INDEX
Explanations
words related to bars or establishments where alcoholic beverages are typically served
New Auto-Interp
Negative Logits
lihood
-0.92
IBLE
-0.82
Instruments
-0.73
sie
-0.72
ãģį
-0.69
ibilities
-0.69
ACTED
-0.68
CHRIST
-0.65
ç«
-0.65
UE
-0.64
POSITIVE LOGITS
bara
1.24
celona
1.17
riers
1.14
itone
1.14
iatric
1.14
bers
1.13
bell
1.10
bed
1.09
becue
1.07
bing
1.03
Activations Density 2.126%