INDEX
    Explanations

    emotional expressions and reactions in dialogue

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.02
    2:0.06
    3:0.12
    4:0.07
    5:0.07
    6:0.02
    7:0.07
    8:0.38
    9:0.02
    10:0.03
    11:0.05
    Negative Logits
     Philly
    -2.67
     Philadelphia
    -2.56
     Byrne
    -2.34
     CBS
    -2.32
     Semin
    -2.30
     Buckley
    -2.28
     Dodd
    -2.28
     federally
    -2.28
     McKenna
    -2.27
     Delaware
    -2.25
    POSITIVE LOGITS
    5.59
    4.96
    ──
    4.54
    4.45
    4.08
    3.96
    3.86
    3.75
     Takeru
    3.65
    sama
    3.57
    Act Density 0.292%

    No Known Activations