INDEX

Explanations

actions and descriptors

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 वर्गी

0.41

 revocation

0.38

姿勢

0.37

 మార

0.36

존

0.35

 quitting

0.35

 قاب

0.34

⋮

0.34

 防止

0.33

 greeting

0.33

POSITIVE LOGITS

 mentally

0.44

 physically

0.43

 liberate

0.41

 regra

0.41

put

0.38

 gently

0.38

 somehow

0.37

 rendere

0.37

 lovingly

0.37

 artificially

0.36

Activations Density 0.220%