INDEX
    Explanations

    proper nouns or names related to a specific individual

    phrases indicating support or endorsement

    New Auto-Interp
    Negative Logits
    ĸļ
    -0.75
    IFF
    -0.71
    uality
    -0.70
    PsyNetMessage
    -0.70
    ¬¼
    -0.69
    acan
    -0.69
    ARY
    -0.68
    acca
    -0.66
     juggling
    -0.66
    Joined
    -0.64
    POSITIVE LOGITS
    guard
    0.85
    lesi
    0.77
    ventus
    0.73
    port
    0.72
    nz
    0.71
    uploads
    0.71
    etheless
    0.70
    plate
    0.68
    restricted
    0.68
    los
    0.68
    Act Density 0.000%

    No Known Activations