INDEX
    Explanations

    proper nouns or mentions of specific entities, such as names of companies, organizations, or titles

    occurrences of the word "the"

    New Auto-Interp
    Negative Logits
     beforehand
    -0.82
    /"
    -0.76
    iod
    -0.73
    Ò
    -0.72
    omever
    -0.71
     thereby
    -0.70
    æ©
    -0.70
    SPONSORED
    -0.70
     without
    -0.67
     directly
    -0.67
    POSITIVE LOGITS
    resa
    1.51
    odore
    1.37
    oret
    1.31
     Latest
    1.07
    orem
    1.06
    ories
    1.04
    atre
    1.03
     Basics
    0.98
     easiest
    0.97
     latest
    0.94
    Act Density 0.192%

    No Known Activations