INDEX
    Explanations

    attends to the token "Merci" from token phrases that include some form of punctuation or emoticon indicating a response

    New Auto-Interp
    Head Attr Weights
    0:0.16
    1:0.19
    2:0.10
    3:0.10
    4:0.10
    5:0.04
    6:0.10
    7:0.17
    Negative Logits
     متعلقه
    -0.46
    aarrggbb
    -0.44
    InjectAttribute
    -0.42
     wireType
    -0.41
    LookAnd
    -0.41
    !*\
    -0.40
    SharedDtor
    -0.40
     resourceCulture
    -0.40
    Geplaatst
    -0.38
    writeField
    -0.38
    POSITIVE LOGITS
    acamole
    0.29
     Massa
    0.28
     EoL
    0.28
    0.26
    ğunu
    0.26
    OTS
    0.26
    ocirc
    0.25
    consin
    0.25
    ElementException
    0.25
    нда
    0.25
    Act Density 0.070%

    No Known Activations