INDEX
Explanations
instances of suffering or health-related distress
New Auto-Interp
Head Attr Weights
0:0.06
1:0.01
2:0.25
3:0.07
4:0.10
5:0.03
6:0.04
7:0.03
8:0.13
9:0.04
10:0.09
11:0.09
Negative Logits
Revolution
-1.32
Scarlet
-1.31
WAR
-1.28
Beaut
-1.27
Merry
-1.27
Victory
-1.25
ivan
-1.24
Kirin
-1.23
Competitive
-1.22
Waterloo
-1.22
POSITIVE LOGITS
NetMessage
1.92
oğ
1.59
glers
1.54
tradem
1.51
destro
1.50
livest
1.49
rys
1.47
ciating
1.47
require
1.45
Interstitial
1.44
Activations Density 0.049%