INDEX
Explanations
phrases that express contradictions or contrasting ideas
New Auto-Interp
Negative Logits
Cosponsors
-0.76
chool
-0.74
ogun
-0.72
itsch
-0.70
lass
-0.69
©¶æ
-0.63
psc
-0.60
abase
-0.60
unes
-0.59
cients
-0.58
POSITIVE LOGITS
satisfaction
0.92
disappointment
0.89
excitement
0.89
sadness
0.88
certainty
0.88
acknowledgement
0.87
acknowledgment
0.85
happiness
0.83
surprises
0.83
degradation
0.82
Activations Density 0.311%