INDEX
    Explanations

    instances of the word "while."

    New Auto-Interp
    Negative Logits
     SAK
    -0.85
     Lyt
    -0.82
     Roch
    -0.80
     Bassett
    -0.80
     ECS
    -0.79
    sett
    -0.77
     initState
    -0.77
    çek
    -0.76
     ALF
    -0.76
    Sak
    -0.75
    POSITIVE LOGITS
     while
    1.79
    while
    1.73
     WHILE
    1.63
     While
    1.61
    WHILE
    1.55
     whilst
    1.52
    While
    1.50
     Whilst
    1.38
    mientras
    1.37
     Enquanto
    1.36
    Act Density 0.073%

    No Known Activations