INDEX
Explanations
pairs of contrasting terms
the conjunction "and" used in various contexts
New Auto-Interp
Negative Logits
uel
-0.79
³
-0.75
odore
-0.71
į
-0.71
iane
-0.70
Ĥª
-0.69
IJ
-0.68
Į
-0.68
º
-0.68
eks
-0.66
POSITIVE LOGITS
ours
0.72
halves
0.72
sexes
0.70
autobiography
0.66
ebook
0.62
nam
0.62
equally
0.60
multiplication
0.60
genders
0.59
admit
0.59
Activations Density 0.148%