INDEX
    Explanations

    attributes related to book design and quality

    New Auto-Interp
    Negative Logits
    åĪĴ
    -0.16
    oren
    -0.15
    bsd
    -0.15
    èĺ
    -0.14
    orer
    -0.14
    opup
    -0.14
    AINED
    -0.13
    å¼Ł
    -0.13
    αιν
    -0.13
    ninger
    -0.13
    POSITIVE LOGITS
     binding
    0.39
     bound
    0.37
    binding
    0.34
     Binding
    0.33
     bind
    0.32
    -bound
    0.31
     bindings
    0.31
    bound
    0.31
    -binding
    0.30
    bind
    0.30
    Act Density 0.061%

    No Known Activations