INDEX
Explanations
phrases related to agreement or majority opinion
phrases indicating agreement or general acceptance among groups
New Auto-Interp
Negative Logits
asus
-0.83
gin
-0.81
strip
-0.80
cer
-0.76
gun
-0.76
gins
-0.75
mers
-0.74
udi
-0.74
ikarp
-0.73
zona
-0.73
POSITIVE LOGITS
consensus
1.01
opinion
0.92
amongst
0.87
unanimously
0.84
among
0.82
ensus
0.79
ensical
0.78
opinions
0.77
atorial
0.75
acceptance
0.74
Activations Density 0.044%