INDEX
    Explanations

    different types or options

    New Auto-Interp
    Negative Logits
     wyłącznie
    0.36
     مذکور
    0.33
     desist
    0.32
     conseguenza
    0.32
     offending
    0.31
     offend
    0.31
     erneut
    0.30
     Rely
    0.30
     प्रतिदिन
    0.30
    反而
    0.29
    POSITIVE LOGITS
     różne
    0.40
    depending
    0.37
     versatile
    0.37
     typical
    0.36
     depending
    0.35
     ambitious
    0.34
    ليزية
    0.33
     options
    0.33
    简单
    0.32
     típico
    0.32
    Act Density 0.982%

    No Known Activations