INDEX

Explanations

so followed by adjective

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 نب

0.44

 Syndicate

0.42

 syndicate

0.40

 أو

0.39

MDP

0.38

ίσ

0.38

ंप

0.38

τική

0.38

 करती

0.37

^{*}(\

0.37

POSITIVE LOGITS

nnnn

0.55

เลย

0.50

Surprisingly

0.48

prisingly

0.47

 awfully

0.46

Lovely

0.46

Interestingly

0.46

 einiger

0.46

value

0.45

 insidious

0.45

Activations Density 0.017%