INDEX
Explanations
emotional expressions and reactions in dialogue
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.06
3:0.12
4:0.07
5:0.07
6:0.02
7:0.07
8:0.38
9:0.02
10:0.03
11:0.05
Negative Logits
Philly
-2.67
Philadelphia
-2.56
Byrne
-2.34
CBS
-2.32
Semin
-2.30
Buckley
-2.28
Dodd
-2.28
federally
-2.28
McKenna
-2.27
Delaware
-2.25
POSITIVE LOGITS
『
5.59
【
4.96
──
4.54
「
4.45
「
4.08
】
3.96
�
3.86
』
3.75
Takeru
3.65
sama
3.57
Activations Density 0.292%