INDEX
Explanations
negative statements or denials
phrases indicating the absence or lack of something
New Auto-Interp
Negative Logits
parts
-0.96
redients
-0.90
agents
-0.89
ents
-0.89
devices
-0.88
anners
-0.88
items
-0.87
projects
-0.85
Actions
-0.85
eals
-0.83
POSITIVE LOGITS
indication
1.41
shortage
1.28
doubt
1.15
guarantee
1.13
suggestion
1.03
evidence
1.03
reason
1.03
way
1.02
requirement
1.01
denying
0.99
Activations Density 0.058%