INDEX
Negative Logits
orp
-0.72
istration
-0.72
uably
-0.72
arcity
-0.70
PHOTOS
-0.69
ila
-0.68
alias
-0.67
anova
-0.66
isoft
-0.65
igsaw
-0.64
POSITIVE LOGITS
voices
1.11
hear
0.89
aloud
0.86
Voices
0.85
footsteps
0.82
louder
0.81
cliffe
0.81
heard
0.80
hearing
0.80
confessions
0.80
Activations Density 0.033%