INDEX
Explanations
words related to shelves
references to shelves or shelf space
New Auto-Interp
Negative Logits
qua
-0.75
ebus
-0.69
went
-0.68
nel
-0.68
Rounds
-0.63
Ascend
-0.62
alez
-0.61
Hybrid
-0.61
Intercept
-0.61
IENT
-0.59
POSITIVE LOGITS
shelf
1.28
shelves
1.19
sheet
0.95
tops
0.82
challeng
0.82
cloth
0.81
stairs
0.79
wip
0.79
ength
0.79
store
0.78
Activations Density 0.009%