INDEX

Explanations

formal and informal language

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 تَ

0.59

 unsurprisingly

0.56

 কিছুক্ষণ

0.52

્યૂ

0.51

<unused2121>

0.49

 عَ

0.47

 രണ്ട്

0.47

<unused2173>

0.47

 Rodríguez

0.45

 اُ

0.45

POSITIVE LOGITS

 IMHO

0.63

 heretofore

0.58

 thru

0.55

!!!!

0.55

Etc

0.55

ie

0.53

 someplace

0.53

 THAT

0.52

 commensurate

0.51

!!!!!

0.50

Activations Density 0.004%