INDEX
Explanations
opposite relationships or positions
terms related to oppositional or contrasting elements
New Auto-Interp
Negative Logits
ebook
-0.95
lished
-0.90
ourning
-0.86
urrent
-0.84
zinski
-0.82
ule
-0.81
ULT
-0.80
brance
-0.78
ourn
-0.78
ulic
-0.77
POSITIVE LOGITS
sides
1.00
directions
0.95
direction
0.87
sexes
0.84
side
0.81
oppos
0.79
opposite
0.78
minded
0.77
corners
0.77
poles
0.76
Activations Density 0.015%