INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ::_('
    -0.66
    deleteById
    -0.66
     vĩnh
    -0.65
     lenne
    -0.59
    raszam
    -0.56
    expandindo
    -0.55
     continuant
    -0.54
     razie
    -0.54
     σήμερα
    -0.54
     surla
    -0.53
    POSITIVE LOGITS
    When
    1.46
     When
    1.44
    Когда
    1.05
    Whenever
    1.03
     Когда
    1.02
     Wanneer
    1.00
     Lorsque
    0.99
     Cuando
    0.99
     Whenever
    0.98
     Quando
    0.97
    Act Density 0.075%

    No Known Activations