INDEX
Explanations
references to the name "Bob."
New Auto-Interp
Negative Logits
Dani
-0.51
Xtreme
-0.49
URIS
-0.49
Kien
-0.47
I
-0.47
Utili
-0.45
jni
-0.44
Dani
-0.44
<h2>
-0.43
Martina
-0.43
POSITIVE LOGITS
Bob
1.05
Bob
1.03
bob
0.87
BOB
0.84
bob
0.84
BOB
0.83
Púb
0.73
bobs
0.73
Majefty
0.71
gebob
0.69
Activations Density 0.004%