INDEX
Explanations
mentions of the organization "Human Rights Watch"
mentions of the organization "Human Rights Watch."
New Auto-Interp
Negative Logits
chemotherapy
-0.75
welding
-0.72
fries
-0.70
ãĤª
-0.69
istically
-0.68
ãĥ´
-0.68
ettel
-0.67
microw
-0.67
weld
-0.66
reopen
-0.66
POSITIVE LOGITS
tower
1.19
dog
1.18
Watch
1.08
dogs
1.06
atcher
1.03
Watching
1.03
wat
0.86
Watch
0.84
atches
0.83
watch
0.83
Activations Density 0.014%