INDEX
Explanations
expressions of solidarity and support for various groups and causes
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.06
3:0.06
4:0.10
5:0.03
6:0.02
7:0.38
8:0.02
9:0.03
10:0.12
11:0.10
Negative Logits
puberty
-1.64
Xan
-1.62
doctor
-1.45
imester
-1.43
mental
-1.39
pills
-1.39
Reincarn
-1.34
metadata
-1.30
rollment
-1.29
kale
-1.29
POSITIVE LOGITS
plight
1.66
MpServer
1.62
solidarity
1.59
against
1.53
brothers
1.40
uncond
1.40
downt
1.39
persecuted
1.38
kindred
1.36
agg
1.35
Activations Density 0.002%