INDEX
Explanations
instances of negativity or negative connotations in the context of social interactions or mental states
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.06
3:0.03
4:0.13
5:0.13
6:0.09
7:0.08
8:0.05
9:0.05
10:0.10
11:0.06
Negative Logits
Malays
-1.32
licence
-1.28
renamed
-1.23
nes
-1.22
ARS
-1.21
variants
-1.15
Update
-1.15
Les
-1.14
USSR
-1.14
defunct
-1.13
POSITIVE LOGITS
emonic
1.60
xiety
1.40
cknow
1.38
amara
1.35
empath
1.33
soothing
1.31
��
1.27
ileged
1.26
nurturing
1.26
Volunte
1.23
Activations Density 0.419%