INDEX
    Explanations

    references to duration or time, particularly the word "long."

    New Auto-Interp
    Negative Logits
    illard
    -0.18
    uyết
    -0.14
    cosa
    -0.14
    mares
    -0.14
    ekil
    -0.14
    uiltin
    -0.14
    infeld
    -0.14
    ละ
    -0.13
     Recorder
    -0.13
    ilden
    -0.13
    POSITIVE LOGITS
     while
    0.36
     long
    0.28
    while
    0.25
     WHILE
    0.24
    _while
    0.24
    gg
    0.24
     LONG
    0.24
     ways
    0.24
    ,long
    0.23
     While
    0.22
    Act Density 0.015%

    No Known Activations