INDEX

Explanations

underlying conditions or structures

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Backed

0.38

 overpowered

0.38

декс

0.38

गराज

0.37

िफिकेशन

0.37

Returned

0.37

 काबिल

0.37

 समानता

0.36

秒

0.36

 маркетин

0.36

POSITIVE LOGITS

 mode

0.63

 animating

0.55

mode

0.54

 modes

0.54

 epistem

0.54

 putative

0.54

 epist

0.54

 imaginative

0.54

modes

0.53

 discursive

0.53

Activations Density 0.041%