INDEX
Explanations
references to paper products or materials
New Auto-Interp
Negative Logits
sb
-0.20
eva
-0.20
sin
-0.18
sa
-0.18
sf
-0.18
sing
-0.17
spb
-0.17
sla
-0.17
sy
-0.17
tics
-0.17
POSITIVE LOGITS
clip
0.39
weight
0.35
backs
0.34
weights
0.31
trail
0.29
towel
0.28
board
0.27
towels
0.26
mill
0.26
work
0.26
Activations Density 0.026%