INDEX

Explanations

expressions of love and enjoyment

The neuron strongly activates on personal and interrogative pronouns (e.g. I, my, she, he, who, what), i.e. words that refer directly to people or pose questions.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

znej

-0.94

hasMany

-0.91

 грамм

-0.87

 бампер

-0.86

 desirable

-0.86

φαλ

-0.85

ικα

-0.82

lden

-0.82

redirects

-0.80

 schön

-0.79

POSITIVE LOGITS

 love

5.94

 loves

4.31

 LOVE

3.70

love

3.69

Love

3.61

 Love

3.56

 loved

3.41

LOVE

3.34

爱

2.81

 любовь

2.78

Activations Density 0.114%