INDEX
Explanations
repeated references to being a part of something, emphasizing inclusion or participation
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.07
3:0.06
4:0.13
5:0.02
6:0.03
7:0.41
8:0.03
9:0.04
10:0.05
11:0.07
Negative Logits
opened
-1.73
BuyableInstoreAndOnline
-1.69
rha
-1.65
entle
-1.65
urances
-1.63
ratulations
-1.57
lease
-1.55
reens
-1.54
qus
-1.53
pload
-1.49
POSITIVE LOGITS
00200000
1.79
shaping
1.68
dramas
1.60
drama
1.51
Rebels
1.49
culture
1.46
troubles
1.45
Incident
1.45
Gutierrez
1.44
disturbances
1.44
Activations Density 0.001%