INDEX
Explanations
phrases related to willingness and negotiation
New Auto-Interp
Negative Logits
Compass
-0.93
////////////////////////////////
-0.77
Offline
-0.72
_.
-0.68
DAQ
-0.68
Cars
-0.66
tracks
-0.65
availability
-0.65
lights
-0.65
alia
-0.65
POSITIVE LOGITS
accept
1.32
sacrifice
1.32
tolerate
1.24
concede
1.24
admit
1.21
negotiate
1.18
embrace
1.15
compromise
1.15
cooperate
1.14
forgive
1.14
Activations Density 0.075%