INDEX
    Explanations

    superlatives or comparisons of degree, such as "most" or "more."

    New Auto-Interp
    Negative Logits
    æ©
    -0.90
    rompt
    -0.88
    heid
    -0.83
    pload
    -0.82
     Films
    -0.79
    oak
    -0.78
    instead
    -0.76
    eto
    -0.74
    undle
    -0.74
    ategories
    -0.74
    POSITIVE LOGITS
     important
    1.29
     powerful
    1.15
     obvious
    1.13
    likely
    1.12
     prominent
    1.10
     interesting
    1.10
     plausible
    1.10
     likely
    1.09
     efficient
    1.09
     profitable
    1.08
    Act Density 10.217%

    No Known Activations