INDEX
Explanations
phrases related to emotions and feelings
New Auto-Interp
Head Attr Weights
0:0.13
1:0.11
2:0.04
3:0.03
4:0.07
5:0.15
6:0.29
7:0.01
8:0.05
9:0.03
10:0.03
11:0.01
Negative Logits
parach
-1.42
[&
-1.39
MT
-1.39
WARE
-1.38
VERTIS
-1.28
Manit
-1.28
medic
-1.26
ograp
-1.25
warnings
-1.23
..................
-1.23
POSITIVE LOGITS
��
1.75
�
1.59
itely
1.48
Rivera
1.48
oya
1.46
�
1.46
arity
1.45
ess
1.43
rive
1.41
cffffcc
1.41
Activations Density 0.076%