INDEX

Explanations

understanding, assessment, and specific characteristics

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

0.58

َرْ

0.48

ious

0.46

 الرحيم

0.45

ological

0.44

0.43

试

0.43

ساوي

0.43

0.42

0.41

POSITIVE LOGITS

↵↵↵↵↵↵↵↵

0.58

SpawnEntry

0.56

¿?

0.54

 brunâtre

0.50

<unused345>

0.49

 てる

0.49

↵↵↵↵↵↵↵↵↵↵

0.49

↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.49

<unused407>

0.49

 維尼

0.49

Activations Density 0.000%