INDEX
Explanations
words related to desires or wants
occurrences of the letter 'w'
New Auto-Interp
Negative Logits
bloom
-0.67
uate
-0.66
Duo
-0.65
tem
-0.64
ŃĶ
-0.63
inhibition
-0.63
semin
-0.63
plaintiff
-0.62
unpre
-0.61
confessions
-0.61
POSITIVE LOGITS
isdom
1.44
elcome
1.43
itness
1.40
orship
1.40
idespread
1.39
orry
1.36
ield
1.36
ielding
1.35
anted
1.33
restling
1.32
Activations Density 0.031%