INDEX
    Explanations

    proper nouns related to various entities and people

    proper nouns, particularly names and entities

    New Auto-Interp
    Negative Logits
    arial
    -0.71
    )=(
    -0.68
    ional
    -0.66
    hy
    -0.66
    è¡
    -0.63
    DERR
    -0.61
    lisher
    -0.61
    hist
    -0.61
    otaur
    -0.61
    soDeliveryDate
    -0.60
    POSITIVE LOGITS
     citiz
    0.72
    ACTED
    0.65
    izont
    0.64
    vation
    0.64
    +.
    0.61
    ecause
    0.60
     cheated
    0.59
    caster
    0.59
    Reviewer
    0.59
    Os
    0.59
    Act Density 0.571%

    No Known Activations