INDEX
Explanations
terms indicating a strong majority opinion or consensus
phrases indicating strong majority opinions or sentiments
New Auto-Interp
Negative Logits
tein
-0.82
href
-0.80
Afee
-0.80
ogy
-0.73
rug
-0.67
ht
-0.65
Encyclopedia
-0.64
Extrem
-0.63
Keeper
-0.63
agate
-0.63
POSITIVE LOGITS
overwhelmingly
0.99
unanimously
0.86
whelming
0.80
unanimous
0.79
avour
0.70
reliant
0.69
overwhelming
0.69
majority
0.69
dominated
0.68
!/
0.68
Activations Density 0.005%