INDEX
    Explanations

    gerunds and actions related to maintaining stability and balance in systems

    New Auto-Interp
    Negative Logits
     ویکی‌پدی
    -0.65
    Diwedd
    -0.63
     propOrder
    -0.61
    Personensuche
    -0.60
    TagMode
    -0.59
    intenant
    -0.59
     ainfi
    -0.55
     increí
    -0.55
     înc
    -0.53
    verwijspagina
    -0.52
    POSITIVE LOGITS
     while
    0.50
    しながら
    0.45
    的同时
    0.44
    while
    0.43
     '\\;'
    0.42
     simultaneously
    0.42
    同时
    0.40
     sambil
    0.40
    しつつ
    0.39
     dabei
    0.38
    Act Density 0.453%

    No Known Activations