INDEX
Explanations
proper nouns or names
instances of the word "aff" in various contexts
New Auto-Interp
Negative Logits
CHR
-0.78
senal
-0.74
Penal
-0.70
Skydragon
-0.61
Santana
-0.60
ANGEL
-0.57
fear
-0.57
Schwarzenegger
-0.56
ATING
-0.56
OPLE
-0.55
POSITIVE LOGITS
ield
1.23
onso
1.09
irmation
1.07
yre
1.06
idav
1.06
ront
1.05
iculty
1.04
inity
1.01
ordable
1.01
licted
0.99
Activations Density 0.023%