INDEX

Explanations

own governing documents

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

因此

0.44

२

0.43

total

0.43

lardır

0.42

ศัพท์

0.41

 العملية

0.41

ש

0.41

Consequently

0.41

ش

0.40

รางวัล

0.40

POSITIVE LOGITS

 Defense

0.56

 gamma

0.52

 respectable

0.51

 subordinate

0.51

 Quincy

0.50

Ryu

0.50

 tripped

0.50

 Proud

0.49

 elected

0.49

 Gamma

0.49

Activations Density 0.003%