INDEX
Explanations
mentions of bars or bar-related activities
mentions of bars
New Auto-Interp
Negative Logits
lihood
-0.97
UAL
-0.82
IBLE
-0.81
ãģį
-0.67
Languages
-0.65
Instruments
-0.65
Sounders
-0.64
åĬ
-0.63
wip
-0.63
Vehicles
-0.63
POSITIVE LOGITS
itone
1.37
celona
1.21
bell
1.12
becue
1.07
bara
1.07
bers
1.07
iatric
1.05
raged
1.02
ista
1.02
keep
0.97
Activations Density 0.031%