INDEX
    Explanations

    editor's notes within articles

    possessive forms indicating ownership or editorial attributions

    New Auto-Interp
    Negative Logits
    ĪĴ
    -0.79
    ansas
    -0.75
    vironment
    -0.72
    facts
    -0.68
    wikipedia
    -0.67
    imedia
    -0.67
    esm
    -0.66
    USD
    -0.66
    flix
    -0.66
    quished
    -0.64
    POSITIVE LOGITS
     own
    0.83
     inability
    0.80
     remorse
    0.77
     grasp
    0.70
     insistence
    0.70
     Guild
    0.70
     ability
    0.69
     penchant
    0.68
     daughter
    0.68
     Wife
    0.68
    Act Density 0.094%

    No Known Activations