INDEX
Explanations
words related to agreement or consensus
phrases indicating consensus or agreement
New Auto-Interp
Negative Logits
ener
-0.79
udi
-0.78
gallery
-0.75
Laksh
-0.70
Lans
-0.66
predators
-0.65
Hutch
-0.64
Mae
-0.64
interrupted
-0.64
Uriel
-0.63
POSITIVE LOGITS
unanimously
0.87
terms
0.83
disagree
0.78
LY
0.76
agreeing
0.74
principle
0.71
hugs
0.69
consensus
0.67
heartedly
0.66
ä¹
0.65
Activations Density 0.154%