INDEX

Explanations

joy and happiness

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

導入

-0.89

rous

-0.89

 tronic

-0.89

moder

-0.87

ug

-0.86

xii

-0.85

nima

-0.85

credentials

-0.83

 aziende

-0.83

 Memoriam

-0.82

POSITIVE LOGITS

 happiness

2.75

joy

2.38

 smiles

1.85

 laughter

1.80

 positivity

1.80

 positive

1.73

 peace

1.63

 love

1.60

 warmth

1.59

 good

1.54

Activations Density 0.041%