INDEX

Explanations

cynicism, skepticism, and distrust

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

馋

0.46

 QnrB

0.45

cznej

0.44

ه

0.43

凿

0.42

実

0.42

тные

0.41

др

0.41

ற

0.41

୍ର

0.40

POSITIVE LOGITS

Wu

0.47

ulo

0.45

 gimana

0.45

 Fresno

0.45

 extortion

0.44

 liste

0.43

 satirical

0.43

 Dallas

0.43

 প্রবন্ধ

0.42

Activations Density 0.000%