INDEX

Explanations

achieving final outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

耷

0.41

 تمہیں

0.41

 اكيد

0.39

Cabe

0.38

VALID

0.38

 Esta

0.37

 pensado

0.37

تما

0.37

そこ

0.36

 olas

0.36

POSITIVE LOGITS

 accomplish

0.53

 achieve

0.48

 achieves

0.46

 мог

0.46

 weapons

0.45

뱃

0.44

 accomplishes

0.43

 accomplished

0.43

 উল্লেখযোগ্য

0.43

 achieved

0.42

Activations Density 0.001%