INDEX
    Explanations

    attends to the token "when" from "by" tokens

    New Auto-Interp
    Head Attr Weights
    0:0.11
    1:0.14
    2:0.12
    3:0.12
    4:0.12
    5:0.08
    6:0.11
    7:0.15
    Negative Logits
    ConstraintMaker
    -0.38
     tartalomajánló
    -0.36
    RegressionTest
    -0.34
    CPtr
    -0.29
    LElement
    -0.28
    AutoresizingMask
    -0.28
    verwijspagina
    -0.27
     ModelExpression
    -0.26
     missionaries
    -0.26
    ResponseWriter
    -0.26
    POSITIVE LOGITS
    Искәрмәләр
    0.24
    Cartney
    0.23
    APPS
    0.23
     صوتيه
    0.23
    ối
    0.23
    ptor
    0.22
    lesi
    0.22
    Palmar
    0.22
     interested
    0.21
     συμ
    0.21
    Act Density 0.252%

    No Known Activations