INDEX

Explanations

riddles, "what am i"

riddle-style prompts and answers in chat transcripts—especially the “What am I?” pattern and turn-boundary markers indicating user/model turns.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

我就

0.37

 stratégies

0.36

!!!!!

0.36

ynı

0.35

 Hendrix

0.34

 эсте

0.34

 юриди

0.34

 специфи

0.34

 рей

0.34

！！！！

0.34

POSITIVE LOGITS

ﺪ

0.35

ി

0.32

 moderately

0.32

0.31

Reverse

0.30

namely

0.30

 clinging

0.30

 morn

0.29

 retract

0.29

 humiliation

0.29

Activations Density 0.123%