INDEX
    Explanations

    references to organizations or institutions

    New Auto-Interp
    Negative Logits
     Hogarth
    -0.84
    |
    
    -0.84
    -0.82
     Eilish
    -0.79
     Roskov
    -0.76
    soldier
    -0.74
    mogorov
    -0.74
    ICom
    -0.73
    PreferredItem
    -0.72
    UrlResolution
    -0.72
    POSITIVE LOGITS
     centers
    2.00
     center
    1.95
     Center
    1.88
     Centre
    1.88
     centre
    1.83
     centres
    1.82
     Centers
    1.80
     CENTER
    1.71
    center
    1.68
     Centres
    1.66
    Act Density 0.046%

    No Known Activations