INDEX

Explanations

in a role for years

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 baba

0.45

 disgust

0.43

 cursing

0.43

แปล

0.42

 peur

0.42

暇

0.41

 verkauft

0.41

 брак

0.41

 vendre

0.40

詛

0.40

POSITIVE LOGITS

 overseeing

0.59

 oversaw

0.58

 oversees

0.57

 oversee

0.55

 responsible

0.48

 overseen

0.47

responsible

0.46

 envisioned

0.44

 contributes

0.44

 manages

0.42

Activations Density 0.012%