INDEX

Explanations

specific tokens after certain words

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Saudi

0.45

Grunge

0.43

quat

0.40

kra

0.39

 goomba

0.39

Kant

0.38

 khách

0.38

kses

0.38

❤❤

0.38

zu

0.38

POSITIVE LOGITS

 hydro

0.43

 workable

0.42

 folk

0.38

 policewomen

0.38

 Rangers

0.37

 Pelican

0.37

 Troubleshooting

0.36

 rangers

0.36

 admirably

0.35

 fraction

0.34

Activations Density 0.001%