INDEX
Explanations
references to the working conditions and challenges faced by workers in specific industries
New Auto-Interp
Negative Logits
avax
-0.18
olle
-0.15
oom
-0.15
_SIMPLE
-0.15
quential
-0.15
edd
-0.15
Yo
-0.14
ANGLES
-0.14
hoo
-0.14
xac
-0.14
POSITIVE LOGITS
ụ
0.24
izu
0.20
Ig
0.19
azor
0.19
á»ĭ
0.19
ozor
0.19
Mb
0.18
á»
0.18
Ok
0.17
Ez
0.17
Activations Density 0.040%