INDEX
Explanations
references to quotes or sources marked with 'W'
instances of the letter "W" in uppercase
New Auto-Interp
Negative Logits
unpre
-0.75
gratification
-0.75
arial
-0.73
apprehension
-0.67
afore
-0.64
uate
-0.63
sucker
-0.62
locality
-0.62
İĭ
-0.60
tion
-0.60
POSITIVE LOGITS
atts
1.30
restling
1.25
nesday
1.18
OW
1.15
orst
1.09
atson
1.09
anted
1.07
esley
1.06
atcher
1.06
izard
1.05
Activations Density 0.075%