INDEX
    Explanations

    specific nouns and proper nouns related to institutions and locations

    New Auto-Interp
    Negative Logits
    itial
    -0.15
    ernel
    -0.14
    ovnÄĽ
    -0.14
     sic
    -0.14
    _RADIO
    -0.14
    itag
    -0.14
    aptcha
    -0.14
    vro
    -0.14
     materi
    -0.14
    olley
    -0.14
    POSITIVE LOGITS
    phis
    0.16
    osal
    0.15
    ноп
    0.15
    ARI
    0.15
    еком
    0.14
    ager
    0.14
    zel
    0.14
    utation
    0.14
    ää
    0.14
    .parser
    0.13
    Act Density 0.039%

    No Known Activations