INDEX
Explanations
mentions of community members or citizens in a sociopolitical context
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.09
4:0.09
5:0.08
6:0.07
7:0.07
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
prus
-3.28
racuse
-2.87
oenix
-2.84
kefeller
-2.83
uana
-2.83
utonium
-2.81
envy
-2.73
zai
-2.71
leen
-2.71
Sov
-2.67
POSITIVE LOGITS
Hitch
2.93
FSA
2.93
Cable
2.92
Cage
2.88
Sphere
2.87
Bra
2.87
Bun
2.83
Breath
2.60
Eddie
2.56
Connection
2.54
Activations Density 0.000%