INDEX
    Explanations

    informal language involving places or organizations, like nicknames or abbreviations

    New Auto-Interp
    Negative Logits
    spring
    -0.80
    ERN
    -0.76
    ELL
    -0.72
    andise
    -0.71
    orage
    -0.69
    ebted
    -0.69
    ifies
    -0.69
    eers
    -0.68
    izations
    -0.68
    ifying
    -0.67
    POSITIVE LOGITS
    volent
    0.82
    vious
    0.80
    judicial
    0.77
    gress
    0.76
    phrine
    0.76
    AFTA
    0.72
    llor
    0.69
    mand
    0.69
    pless
    0.69
    boro
    0.67
    Act Density 0.027%

    No Known Activations