INDEX

Explanations

breakdowns of complex topics

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 इन्होंने

0.79

<-

0.67

 折り

0.64

(!

0.63

classical

0.62

Appendix

0.62

viridis

0.62

আনুশকা

0.62

(!

0.60

Spotify

0.59

POSITIVE LOGITS

 weaknesses

0.82

 weakness

0.80

 misunder

0.78

 first

0.76

 loved

0.76

結局

0.76

 weakened

0.75

愛

0.74

 points

0.74

 First

0.73

Activations Density 0.130%