INDEX
Explanations
phrases indicating agreement or consent
instances of consensus or agreement
New Auto-Interp
Negative Logits
oufl
-0.77
vas
-0.70
crow
-0.68
Tycoon
-0.67
esa
-0.66
Blooming
-0.65
agin
-0.65
asus
-0.64
glands
-0.63
omial
-0.63
POSITIVE LOGITS
unanimously
1.00
agreement
0.80
reements
0.78
ipeg
0.77
agreeing
0.76
agre
0.74
reement
0.74
ettle
0.73
agree
0.73
rences
0.73
Activations Density 0.038%