INDEX
Explanations
the phrases where someone expresses their opinion or belief
phrases indicating expressions of personal opinion or assertion
New Auto-Interp
Negative Logits
mes
-0.74
govtrack
-0.64
ukong
-0.63
awar
-0.61
decom
-0.60
leans
-0.60
arra
-0.60
ahime
-0.59
Lenin
-0.59
uga
-0.56
POSITIVE LOGITS
confidently
0.92
anecd
0.77
quist
0.76
ħĭ
0.74
idate
0.74
vividly
0.68
Attribution
0.66
unequivocally
0.66
rist
0.65
003
0.64
Activations Density 0.095%