INDEX
    Explanations

    references to educational institutions, particularly high schools

    New Auto-Interp
    Negative Logits
    hind
    -0.08
    ushman
    -0.08
    oyer
    -0.07
    ieber
    -0.07
    ilyn
    -0.07
    idth
    -0.07
    еÑĢв
    -0.07
    ÑĭÑĤ
    -0.07
    dur
    -0.07
    ervo
    -0.07
    POSITIVE LOGITS
    985
    0.06
     ÑĢазви
    0.06
    owment
    0.06
    aku
    0.06
    357
    0.06
    atos
    0.06
     Jarvis
    0.05
    uten
    0.05
    rog
    0.05
    442
    0.05
    Act Density 0.009%

    No Known Activations