INDEX
Explanations
differences or choices between two entities
contrasting ideas or themes
New Auto-Interp
Negative Logits
uca
-0.86
ccording
-0.69
fundament
-0.68
¯
-0.67
attRot
-0.67
\'
-0.66
aez
-0.66
RELEASE
-0.66
.ãĢį
-0.65
ens
-0.65
POSITIVE LOGITS
Conversely
0.76
abase
0.72
Saharan
0.70
versus
0.64
whereas
0.63
Or
0.60
Whereas
0.59
others
0.59
alternatively
0.59
while
0.58
Activations Density 1.056%