INDEX
    Explanations

    proper nouns and specific names

    definite articles, indicating a focus on specific entities or subjects

    New Auto-Interp
    Negative Logits
    ADE
    -0.74
    å·
    -0.71
    Ïī
    -0.66
    watching
    -0.65
    existent
    -0.64
    estern
    -0.63
    bane
    -0.63
     âĢº
    -0.62
    acet
    -0.62
    ãĥīãĥ©
    -0.62
    POSITIVE LOGITS
     remainder
    1.71
     rest
    1.54
     remaining
    1.51
     others
    1.46
     latter
    1.21
     youngest
    1.20
    oret
    1.17
     Others
    1.13
     oldest
    1.07
     other
    1.05
    Act Density 0.318%

    No Known Activations