INDEX
Explanations
references to shirts
New Auto-Interp
Negative Logits
ronpa
-0.53
allan
-0.52
Max
-0.51
Grand
-0.50
numerus
-0.49
Moreau
-0.49
OGA
-0.48
GV
-0.48
BV
-0.48
Gregorio
-0.48
POSITIVE LOGITS
Shirt
1.21
Shirt
1.16
shirt
1.15
Shirts
1.04
shirt
1.02
shirts
1.00
Shirts
0.98
shirts
0.89
hirt
0.85
Monfieur
0.85
Activations Density 0.004%