INDEX

Explanations

gender pay equality

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 unusually

-0.81

 czł

-0.81

 despe

-0.78

 terminado

-0.75

Hp

-0.75

 маршру

-0.74

ignty

-0.73

ổi

-0.73

 tous

-0.73

 máscara

-0.73

POSITIVE LOGITS

pay

1.48

 equal

1.36

 salary

1.35

 wage

1.35

equal

1.28

 disparity

1.26

Equal

1.25

salary

1.23

 Equal

1.23

 gender

1.15

Activations Density 0.011%