INDEX
Explanations
phrases related to willingness or openness
expressions of willingness or intent to take action
New Auto-Interp
Negative Logits
Compass
-0.65
Sina
-0.65
Statistics
-0.64
availability
-0.62
iens
-0.62
Bound
-0.61
calling
-0.60
ndra
-0.60
ants
-0.60
Textures
-0.59
POSITIVE LOGITS
sacrifice
1.28
concede
1.27
tolerate
1.22
compromise
1.17
accept
1.14
gamble
1.13
overlook
1.12
cooperate
1.12
forgive
1.04
sacrific
1.03
Activations Density 0.108%