INDEX
Explanations
references to the name "Bruce."
New Auto-Interp
Negative Logits
Starlight
-0.73
meub
-0.67
Nelly
-0.66
раздо
-0.66
coration
-0.66
Haddad
-0.64
Octavia
-0.64
SEÑ
-0.63
••••
-0.63
logement
-0.63
POSITIVE LOGITS
Bruce
1.74
Bruce
1.59
BRUCE
1.38
bruce
1.34
bruce
1.31
Springsteen
1.10
bru
1.02
thâu
0.89
Bru
0.86
bru
0.85
Activations Density 0.015%