INDEX
Explanations
elements related to accountability and legal processes regarding human rights
New Auto-Interp
Negative Logits
anton
-0.17
transitions
-0.15
ruh
-0.15
ibe
-0.15
varargin
-0.15
ogle
-0.14
mitter
-0.14
ê·ł
-0.14
Ñıн
-0.14
transient
-0.14
POSITIVE LOGITS
Gaza
0.29
Gaz
0.24
aid
0.24
Freedom
0.23
fl
0.23
humanitarian
0.22
Block
0.21
Palestine
0.21
block
0.20
Free
0.20
Activations Density 0.003%