INDEX

Explanations

defining comparisons and states

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ﺮ

0.42

мым

0.41

пі

0.41

 पहुंचते

0.40

പ്പെട്ടു

0.40

 серд

0.39

 Dogg

0.39

 해당

0.39

咽

0.39

 सबसे

0.39

POSITIVE LOGITS

 perhaps

0.51

perhaps

0.50

 Perhaps

0.49

 motivations

0.49

 conceivably

0.48

 talvez

0.48

 desde

0.45

 geändert

0.45

 conformity

0.45

也許

0.45

Activations Density 0.010%