INDEX
Explanations
mentions of HR (human resources) departments or organizations
references to human rights topics
New Auto-Interp
Negative Logits
âĸ¬
-0.73
cart
-0.70
Albion
-0.66
forth
-0.65
Homo
-0.63
coni
-0.62
Tycoon
-0.61
kick
-0.60
eers
-0.60
slow
-0.59
POSITIVE LOGITS
RR
0.96
DP
0.92
adish
0.92
LIN
0.90
anging
0.89
OME
0.87
RI
0.86
OUGH
0.86
UN
0.84
ANGE
0.83
Activations Density 0.015%