INDEX
Explanations
physical attributes or medical conditions
terms associated with healthcare, law enforcement, and societal norms
New Auto-Interp
Negative Logits
)=(
-0.55
istrate
-0.52
shapeshifter
-0.49
etimes
-0.49
minist
-0.49
earable
-0.48
addon
-0.48
Canaver
-0.47
enum
-0.46
archived
-0.45
POSITIVE LOGITS
or
0.81
etc
0.75
/.
0.60
and
0.59
/
0.56
&
0.52
_.
0.52
et
0.52
®,
0.51
*.
0.50
Activations Density 1.159%