INDEX
Explanations
names related to people or places
occurrences of the name "Bar" followed by various identifiers or titles
New Auto-Interp
Negative Logits
lihood
-0.93
SOS
-0.74
ãģį
-0.70
UAL
-0.70
hower
-0.69
IBLE
-0.68
Dangerous
-0.67
STATES
-0.66
士
-0.66
CRIP
-0.65
POSITIVE LOGITS
becue
1.23
riers
1.16
celona
1.16
itone
1.16
bell
1.06
rington
1.04
rier
1.04
bara
1.00
iatric
0.98
keep
0.96
Activations Density 0.017%