INDEX
Explanations
phrases related to contrasting or comparing different aspects or entities
phrases that contrast different subjects or points of view
New Auto-Interp
Negative Logits
obin
-0.65
hess
-0.58
zing
-0.55
ahl
-0.55
Roose
-0.55
<@
-0.55
'.
-0.53
zn
-0.53
GET
-0.51
Slash
-0.51
POSITIVE LOGITS
fared
0.73
flourished
0.72
outper
0.70
accommod
0.68
disclaim
0.68
thri
0.67
derives
0.67
teaches
0.66
pmwiki
0.66
lishes
0.65
Activations Density 0.292%