INDEX

Explanations

per

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

гля

-0.50

Personensuche

-0.50

ьаж

-0.50

AxisAlignment

-0.49

PPA

-0.48

ñor

-0.47

enak

-0.47

 CURIAM

-0.46

ArrowToggle

-0.46

 especiais

-0.45

POSITIVE LOGITS

/******/

0.62

Jereo

0.52

xodo

0.52

ⓧ

0.51

ist

0.50

colazione

0.49

oxid

0.48

 désolés

0.47

 ozone

0.46

oxidation

0.46

Activations Density 0.008%