INDEX
Explanations
statements indicating doubt or negation
negations related to various assertions or claims
New Auto-Interp
Negative Logits
çīĪ
-0.69
insula
-0.67
ãĤ¼ãĤ¦ãĤ¹
-0.65
¿½
-0.64
Rouge
-0.64
DIT
-0.63
Maiden
-0.62
sided
-0.62
rpm
-0.62
Globe
-0.62
POSITIVE LOGITS
necessarily
1.07
adequately
1.04
sufficiently
0.96
really
0.91
icably
0.90
bother
0.87
icable
0.87
really
0.86
bluff
0.86
ional
0.84
Activations Density 0.221%