INDEX
    Explanations

    references to artistic expression and creative works

    New Auto-Interp
    Negative Logits
    istrovstvÃŃ
    -0.15
    dro
    -0.15
     Protective
    -0.15
     konkrét
    -0.14
    rid
    -0.14
    otch
    -0.14
    nam
    -0.14
    atic
    -0.14
    ixin
    -0.14
    iced
    -0.14
    POSITIVE LOGITS
     Dy
    0.19
    aty
    0.18
     Dys
    0.18
    elow
    0.18
     dys
    0.17
    otyp
    0.17
     dy
    0.17
    ograf
    0.17
    yn
    0.17
    indy
    0.16
    Act Density 0.065%

    No Known Activations