INDEX
Explanations
mentions of specific names or brands, particularly those associated with music or popular culture
starting with "Bru", "Ru", or "subrufescens"
Bru or ru followed by endings
New Auto-Interp
Negative Logits
reaſon
-0.60
NTB
-0.59
Sabina
-0.58
Segurança
-0.58
Achtung
-0.58
pleaſure
-0.57
Epilepsy
-0.57
phant
-0.55
Cina
-0.55
Heiden
-0.54
POSITIVE LOGITS
ru
0.85
Bru
0.83
Bru
0.80
Ru
0.77
ru
0.69
bru
0.67
bru
0.61
Ru
0.61
Bruce
0.57
Brus
0.56
Activations Density 0.176%