INDEX

Explanations

feelings, health

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 happy

-1.01

happy

-1.01

Efq

-0.98

 happiest

-0.96

 happier

-0.95

HAPPY

-0.90

Happy

-0.90

 Happy

-0.88

 nahilalakip

-0.86

 HAPPY

-0.85

POSITIVE LOGITS

HideFlags

0.47

ям

0.44

ScopeManager

0.44

 متعلقه

0.44

ret

0.42

dum

0.40

рек

0.40

 lombok

0.39

 with

0.39

Activations Density 0.057%