INDEX
    Explanations

    attends to "to" from functions or phrases that are related to the token "would."

    New Auto-Interp
    Head Attr Weights
    0:0.13
    1:0.12
    2:0.10
    3:0.05
    4:0.06
    5:0.02
    6:0.23
    7:0.24
    Negative Logits
    AndEndTag
    -0.41
     كومونز
    -0.37
    IonicModule
    -0.35
    (;;)
    -0.34
     cardíaca
    -0.34
     invokingState
    -0.33
     translateY
    -0.33
     Haller
    -0.32
    AddField
    -0.32
     unknownFields
    -0.32
    POSITIVE LOGITS
    </h1>
    0.40
    umba
    0.35
     похо
    0.32
    coste
    0.31
    IENTE
    0.31
    WriteLiteral
    0.31
    )_/¯
    0.31
    minecraft
    0.31
     Parti
    0.31
    0.30
    Act Density 0.049%

    No Known Activations