INDEX
Explanations
references to medium-sized entities or classifications
New Auto-Interp
Negative Logits
ment
-0.18
ides
-0.16
zym
-0.16
iaux
-0.15
anter
-0.15
adi
-0.15
anal
-0.15
actory
-0.14
isd
-0.14
radi
-0.14
POSITIVE LOGITS
sized
0.41
-sized
0.41
Sized
0.31
-size
0.28
size
0.24
ship
0.23
-large
0.23
size
0.22
-length
0.22
-range
0.22
Activations Density 0.019%