INDEX
    Explanations

    phrases indicating a recommended course of action or a comparison between different approaches or states

    New Auto-Interp
    Negative Logits
     territo
    -1.00
     tew
    -1.00
     excu
    -0.99
     profi
    -0.96
     dises
    -0.96
     rafra
    -0.91
     abnorm
    -0.91
     hina
    -0.91
     maksi
    -0.91
     „,
    -0.90
    POSITIVE LOGITS
    собенности
    0.66
    фициальный
    0.63
     simply
    0.62
    lepiej
    0.62
     prostu
    0.61
    nowu
    0.60
     via
    0.59
    ypeł
    0.56
    фициаль
    0.56
    municipi
    0.56
    Act Density 0.233%

    No Known Activations