INDEX
Negative Logits
ACTION
-0.65
caution
-0.60
Scorp
-0.60
ARTICLE
-0.60
refrain
-0.59
lessly
-0.58
specificity
-0.58
DRAG
-0.58
regard
-0.56
amend
-0.56
POSITIVE LOGITS
thia
1.20
aptic
1.16
nen
1.12
olds
0.99
nis
0.93
ocent
0.92
kees
0.89
emies
0.89
esis
0.88
osure
0.87
Activations Density 0.026%