INDEX
    Explanations

    Genuineness

    New Auto-Interp
    Negative Logits
     za
    -0.06
     heavy
    -0.06
     per
    -0.06
    (box
    -0.06
    μφ
    -0.06
     Opt
    -0.06
     optimal
    -0.06
     širo
    -0.06
     invoking
    -0.06
    γχ
    -0.06
    POSITIVE LOGITS
    разд
    0.07
    ,body
    0.07
     Pais
    0.06
    network
    0.06
     déf
    0.06
    liğin
    0.06
    resultSet
    0.06
     Clinic
    0.06
     listen
    0.06
    _Interface
    0.06
    Act Density 0.112%

    No Known Activations