INDEX
Explanations
terms related to opposition or contrasting viewpoints
phrases involving 'back-and-forth' interactions or debates
New Auto-Interp
Negative Logits
ulhu
-0.77
ãģ®éŃĶ
-0.70
ãĤ¼ãĤ¦ãĤ¹
-0.69
Phi
-0.69
DOI
-0.69
Volcano
-0.69
Parenthood
-0.64
improv
-0.62
iaz
-0.61
QC
-0.61
POSITIVE LOGITS
end
1.14
backed
1.06
eyed
1.04
to
1.02
side
1.01
arching
1.00
based
0.99
office
0.98
eye
0.97
front
0.97
Activations Density 0.081%