INDEX
Explanations
expressions of strong emotions, particularly frustration or distress
New Auto-Interp
Head Attr Weights
0:0.04
1:0.09
2:0.13
3:0.07
4:0.07
5:0.11
6:0.17
7:0.03
8:0.10
9:0.04
10:0.04
11:0.06
Negative Logits
inclination
-1.49
exceptions
-1.49
circumstance
-1.43
slip
-1.43
contacts
-1.41
hers
-1.34
shrug
-1.33
follow
-1.31
splash
-1.29
curs
-1.29
POSITIVE LOGITS
sbm
1.85
eco
1.71
iors
1.66
sty
1.64
tun
1.62
aquin
1.57
ashtra
1.56
Reviewer
1.53
rir
1.52
channelAvailability
1.52
Activations Density 0.000%