INDEX

Explanations

words starting with Whe

The neuron activates on occurrences of the letter sequence “whe,” flagging tokens that begin with that substring (e.g. “Wheaton,” “Wheatstone,” “wheezing,” etc.).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

坂

-0.79

 enjoy

-0.76

 sağlay

-0.74

ހ

-0.73

 производ

-0.73

 tega

-0.73

telek

-0.71

strict

-0.71

 enjoyment

-0.69

 Singles

-0.68

POSITIVE LOGITS

Whe

1.73

Whe

1.45

whe

1.31

whe

1.21

WHE

1.19

WHE

1.09

 whee

0.98

 Wheel

0.90

 wheel

0.82

 whet

0.79

Activations Density 0.016%