INDEX

Explanations

pronouns (they, them, she)

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

und

0.58

approximately

0.55

极

0.53

 latitude

0.51

 subscribe

0.50

 फाइ

0.50

 scale

0.49

 roughly

0.49

極

0.49

 مغ

0.48

POSITIVE LOGITS

 Them

0.92

 THEM

0.92

 THEY

0.82

 તેણી

0.82

 Gender

0.76

เธอ

0.73

 देम

0.73

 Femin

0.73

性别

0.72

 Она

0.72

Activations Density 0.264%