INDEX
Explanations
mentions of societal issues related to discrimination and human rights
New Auto-Interp
Negative Logits
0000000000000000
-0.66
lot
-0.66
NOW
-0.66
lodged
-0.66
HAEL
-0.65
%%
-0.65
soever
-0.64
debuted
-0.63
LOG
-0.63
ALSE
-0.62
POSITIVE LOGITS
lieu
1.48
effic
1.45
accordance
1.39
efficiency
1.38
spite
1.38
relation
1.37
conjunction
1.31
roads
1.28
clusions
1.27
ordinate
1.25
Activations Density 1.185%