INDEX
Explanations
capitalized words or acronyms preceded by a 'W'
occurrences of the letter 'W'
New Auto-Interp
Negative Logits
unpre
-0.80
gratification
-0.74
arial
-0.72
apprehension
-0.69
uate
-0.68
afore
-0.67
unarmed
-0.67
succeeding
-0.66
bottleneck
-0.63
Khe
-0.63
POSITIVE LOGITS
atts
1.31
OW
1.20
restling
1.19
atson
1.16
INGS
1.12
edge
1.12
reck
1.11
ITCH
1.10
ITNESS
1.09
ITH
1.08
Activations Density 0.037%