INDEX

Explanations

self and own concepts

The neuron detects words relating to self‐ownership or one’s own identity (e.g. “self,” “own,” “ownership,” “autologous,” etc.).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

setCellStyle

-1.03

</u>

-0.88

àn

-0.87

mari

-0.87

</td>

-0.86

 trái

-0.85

uintptr

-0.85

 wspar

-0.84

饰演

-0.84

toHexString

-0.83

POSITIVE LOGITS

 self

1.68

self

1.62

own

1.37

 SELF

1.34

自

1.33

Selbst

1.25

Self

1.21

OWN

1.17

 vlastní

1.16

SELF

1.16

Activations Density 0.118%