INDEX

Explanations

related to concepts or actions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 INTERACTIONS

0.50

 OPERATIONS

0.48

 operaciones

0.46

 unicorns

0.46

荞

0.46

 Agricultura

0.44

 Ocak

0.44

ग्रियों

0.44

CONCLUSIONS

0.43

 récupérer

0.43

POSITIVE LOGITS

 говорил

0.41

ness

0.40

松

0.39

 resented

0.39

Hul

0.39

amn

0.38

stoß

0.38

 Ting

0.38

 misdemeanor

0.38

енти

0.37

Activations Density 0.001%