INDEX
Explanations
negative statements or denials
negations or expressions of absence
New Auto-Interp
Negative Logits
Js
-0.75
ean
-0.74
eals
-0.73
alde
-0.72
rs
-0.72
ents
-0.70
rn
-0.69
lish
-0.69
agents
-0.68
rand
-0.67
POSITIVE LOGITS
shortage
1.35
doubt
1.28
indication
1.18
reason
1.11
denying
1.06
guarantee
1.05
excuse
1.05
ambiguity
1.01
difference
0.97
timetable
0.96
Activations Density 0.074%