INDEX

Explanations

Reinforcement Learning from Human Feedback

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

itale

0.64

 peacekeeping

0.62

lique

0.62

स्टेबल

0.62

teach

0.61

寘

0.61

LIM

0.61

Teach

0.60

leftarrow

0.60

 teach

0.59

POSITIVE LOGITS

 पर्

0.63

ਾਸ

0.61

wyp

0.61

면

0.60

などで

0.59

ナット

0.59

 rendered

0.59

 सृष्टि

0.58

pů

0.58

 paginate

0.58

Activations Density 0.063%