INDEX
Explanations
instances of contrasting conjunctions or phrases that indicate opposition
New Auto-Interp
Negative Logits
eniable
-0.17
osaur
-0.16
uin
-0.15
erval
-0.15
Nos
-0.15
oreal
-0.15
anth
-0.14
oun
-0.14
/browser
-0.14
thankfully
-0.14
POSITIVE LOGITS
beg
0.26
thems
0.22
who
0.21
who
0.20
shrugged
0.19
shr
0.18
oh
0.18
hey
0.17
sometimes
0.17
heck
0.16
Activations Density 0.118%