INDEX
    Explanations

    attends to "they," "and," "it," "you," and "are" from subsequent tokens that follow

    New Auto-Interp
    Head Attr Weights
    0:0.16
    1:0.17
    2:0.12
    3:0.12
    4:0.11
    5:0.03
    6:0.10
    7:0.15
    Negative Logits
     gainera
    -0.31
    MessageOf
    -0.30
     JpaRepository
    -0.30
     deere
    -0.30
    aparte
    -0.29
     ExecuteAsync
    -0.29
    الدراسه
    -0.29
     plafond
    -0.28
    crisy
    -0.27
    ISTRATION
    -0.27
    POSITIVE LOGITS
     transfieras
    0.33
    KATH
    0.28
     noDo
    0.28
    rrggbb
    0.27
    WriteBarrier
    0.27
    WriteAttribute
    0.26
    WebVitals
    0.26
    LookAnd
    0.26
    клопе
    0.24
     specchio
    0.24
    Act Density 7.828%

    No Known Activations