INDEX
Explanations
discussions related to political dissent and human rights issues
New Auto-Interp
Negative Logits
OGND
-0.65
rrggbb
-0.64
IsContent
-0.52
Spisak
-0.52
cordova
-0.52
isOk
-0.51
مزید
-0.50
rosario
-0.50
skim
-0.46
noDo
-0.46
POSITIVE LOGITS
виправивши
0.72
dared
0.57
Tikang
0.55
dares
0.55
DISE
0.52
courage
0.51
Dare
0.51
ragos
0.50
disobedience
0.50
jsonwebtoken
0.49
Activations Density 0.144%