INDEX
Explanations
expressions related to agreement or unanimity of opinions
references to general agreement or shared viewpoints
New Auto-Interp
Negative Logits
hib
-0.85
gin
-0.80
chester
-0.77
strip
-0.77
ikarp
-0.76
thy
-0.73
vern
-0.71
undai
-0.70
duc
-0.70
gun
-0.70
POSITIVE LOGITS
consensus
1.02
unanimously
0.83
ensus
0.78
amongst
0.76
IFIED
0.75
itarian
0.74
opinion
0.72
among
0.71
ãĥ¼ãĥĨ
0.71
20439
0.71
Activations Density 0.026%