INDEX
    Explanations

    references to locations, specifically pertaining to schools or institutions

    New Auto-Interp
    Negative Logits
    ove
    -0.18
    лаж
    -0.15
    anka
    -0.15
    OW
    -0.15
    @student
    -0.15
    ibr
    -0.15
    retty
    -0.14
    ower
    -0.14
    aan
    -0.14
    anguage
    -0.14
    POSITIVE LOGITS
    yle
    0.17
    á»ķ
    0.17
    elah
    0.16
    YLES
    0.16
    onz
    0.15
    itag
    0.15
    çı
    0.15
     Brands
    0.15
    etic
    0.15
    ommen
    0.15
    Act Density 0.029%

    No Known Activations