INDEX
Explanations
dualities or comparisons between different entities
the phrase "on the other hand."
New Auto-Interp
Negative Logits
unfocusedRange
-0.60
burg
-0.56
regor
-0.55
Cyborg
-0.54
lessly
-0.52
Photographer
-0.52
ibur
-0.51
ordial
-0.49
uci
-0.49
Biology
-0.48
POSITIVE LOGITS
hand
1.34
side
1.21
hand
0.99
side
0.97
flank
0.89
hemisphere
0.88
axis
0.84
paw
0.84
extreme
0.82
sth
0.81
Activations Density 0.026%