INDEX
Explanations
statements where there is agreement or consensus
statements of consensus or agreement
New Auto-Interp
Negative Logits
oufl
-0.69
Ascension
-0.65
ADS
-0.64
illus
-0.64
zh
-0.64
DAQ
-0.63
Advent
-0.63
ãĤ¹
-0.62
ember
-0.62
dimension
-0.61
POSITIVE LOGITS
agrees
1.08
agree
0.99
agree
0.91
reement
0.85
ilibrium
0.84
terday
0.83
ajor
0.81
agreeing
0.80
reements
0.79
agreed
0.78
Activations Density 0.003%