INDEX
Explanations
references to protests and demonstrations
New Auto-Interp
Negative Logits
aucune
-0.56
numberWith
-0.55
alway
-0.53
altrett
-0.53
Voi
-0.53
incidence
-0.53
aucun
-0.53
aucune
-0.52
netinet
-0.52
måned
-0.51
POSITIVE LOGITS
protest
0.96
protests
0.89
ent
0.86
protest
0.85
Protest
0.78
protested
0.75
providedIn
0.75
entre
0.75
ckeye
0.71
Protest
0.71
Activations Density 0.083%