INDEX
Explanations
instances of specific coded abbreviations
references to the Diagnostic and Statistical Manual of Mental Disorders (DSM)
New Auto-Interp
Negative Logits
swick
-0.86
============
-0.71
symmetry
-0.70
Reviewer
-0.68
ishment
-0.64
issance
-0.64
========
-0.64
ufact
-0.63
pains
-0.63
ished
-0.61
POSITIVE LOGITS
omething
1.13
IFF
0.96
OME
0.91
LR
0.87
yll
0.87
FW
0.86
DK
0.86
HI
0.85
CAP
0.85
III
0.85
Activations Density 0.035%