INDEX
Explanations
personal statements or inner reflections describing oneself
phrases emphasizing exclusivity or being the only one
New Auto-Interp
Negative Logits
mares
-0.88
tones
-0.81
cies
-0.76
ieties
-0.73
ulations
-0.71
ships
-0.71
asonry
-0.71
ulas
-0.70
hips
-0.69
yles
-0.69
POSITIVE LOGITS
smartest
1.28
bearer
1.17
happiest
1.17
beneficiary
1.14
guy
1.11
rightful
1.08
embodiment
1.07
aggress
1.06
messenger
1.04
underdog
1.04
Activations Density 0.135%