INDEX
Explanations
instances of the word "perfect" and its variations
New Auto-Interp
Negative Logits
AndPassword
-0.17
atee
-0.16
holder
-0.15
felt
-0.15
iaux
-0.14
EEP
-0.14
ette
-0.14
оÑĤв
-0.14
ãĥ©ãĥ³
-0.14
bury
-0.14
POSITIVE LOGITS
storm
0.24
ing
0.23
ively
0.22
amente
0.22
ible
0.21
ibil
0.21
timing
0.21
imperfect
0.20
perfectly
0.20
-fit
0.20
Activations Density 0.025%