INDEX
Explanations
attributes related to book design and quality
New Auto-Interp
Negative Logits
åĪĴ
-0.16
oren
-0.15
bsd
-0.15
èĺ
-0.14
orer
-0.14
opup
-0.14
AINED
-0.13
å¼Ł
-0.13
αιν
-0.13
ninger
-0.13
POSITIVE LOGITS
binding
0.39
bound
0.37
binding
0.34
Binding
0.33
bind
0.32
-bound
0.31
bindings
0.31
bound
0.31
-binding
0.30
bind
0.30
Activations Density 0.061%