INDEX
Explanations
references to bars and related establishments
New Auto-Interp
Negative Logits
'")
-0.69
__))
-0.68
뀜
-0.67
Aktualisiert
-0.66
뀌
-0.66
')):
-0.62
})`
-0.62
]');
-0.61
--}}
-0.61
)]$
-0.61
POSITIVE LOGITS
BAR
1.21
bar
1.14
BAR
1.12
bar
1.12
Bar
1.07
Bar
1.06
ibar
1.02
bars
1.01
Bars
1.00
IBar
0.98
Activations Density 0.154%