INDEX
Negative Logits
Mia
0.41
Intercept
0.41
UserId
0.41
Secret
0.40
mia
0.40
imi
0.38
secret
0.38
檤
0.38
mia
0.38
amı
0.37
POSITIVE LOGITS
Collins
0.75
Lincoln
0.69
Alton
0.69
Collins
0.68
Espan
0.62
Española
0.61
Lincoln
0.59
Lum
0.57
Alton
0.57
Deco
0.56
Activations Density 0.002%