INDEX
Explanations
repeated references to a specific alignment or comparison, often associated with audience reactions or emotional responses
New Auto-Interp
Head Attr Weights
0:0.20
1:0.06
2:0.10
3:0.07
4:0.13
5:0.08
6:0.06
7:0.04
8:0.07
9:0.03
10:0.07
11:0.04
Negative Logits
anmar
-1.33
processing
-1.25
cas
-1.23
asus
-1.20
ruary
-1.20
chnology
-1.17
batch
-1.16
casing
-1.15
ancial
-1.15
activ
-1.15
POSITIVE LOGITS
Dialogue
1.20
externalActionCode
1.16
Certainly
1.15
�
1.13
Whoever
1.12
Sean
1.10
Laughs
1.09
�
1.09
Journal
1.07
ONSORED
1.07
Activations Density 0.103%