INDEX
Explanations
exclamatory expressions of surprise or amazement
exclamatory expressions of surprise or amazement
New Auto-Interp
Negative Logits
obligated
-0.68
independ
-0.64
embr
-0.62
externalToEVAOnly
-0.61
redress
-0.59
alternate
-0.59
contracted
-0.58
icipated
-0.58
fullest
-0.57
epend
-0.56
POSITIVE LOGITS
zers
1.35
!:
1.06
!
1.06
ww
1.04
wow
1.03
Wow
1.02
orld
1.02
pedia
1.01
!,
1.01
wow
1.00
Activations Density 0.019%