INDEX
Explanations
words related to social gatherings and activities, particularly those involving food and drink like bars, picnics, and BBQs
references to bars or bar environments
New Auto-Interp
Negative Logits
Liang
-0.70
IDS
-0.69
Voy
-0.67
Virus
-0.67
OTA
-0.65
ils
-0.64
ENG
-0.64
iev
-0.63
ctive
-0.63
Exper
-0.62
POSITIVE LOGITS
bar
3.70
bars
2.79
bar
2.41
Bar
2.30
bars
2.13
Bar
2.05
Bars
2.03
BAR
1.73
bart
1.60
bartender
1.53
Activations Density 0.015%