INDEX
    Explanations

    references to specific events or actions occurring over time

    New Auto-Interp
    Negative Logits
     juſ
    -0.59
     themſelves
    -0.54
     Monfieur
    -0.53
     Conſ
    -0.52
     незавершена
    -0.50
     quæ
    -0.50
     itſelf
    -0.50
    AndEndTag
    -0.50
     himſelf
    -0.49
     myſelf
    -0.49
    POSITIVE LOGITS
     again
    1.04
    Again
    0.88
    again
    0.86
     Again
    0.84
     AGAIN
    0.76
     опять
    0.75
     novamente
    0.71
     opět
    0.71
     nuevamente
    0.69
     igjen
    0.68
    Act Density 0.485%

    No Known Activations