INDEX
Explanations
phrases or concepts related to contrasting or comparing two things
phrases that present contrasting viewpoints using "one hand" and "the other hand."
New Auto-Interp
Negative Logits
zag
-0.81
dr
-0.77
nz
-0.75
zl
-0.72
ré
-0.72
ral
-0.72
MJ
-0.69
ns
-0.67
agus
-0.67
brook
-0.66
POSITIVE LOGITS
behalf
0.68
reconcil
0.67
naïve
0.64
applaud
0.59
pret
0.59
rejoice
0.59
coasts
0.58
representing
0.58
depict
0.57
theless
0.57
Activations Density 0.037%