INDEX

Explanations

class implementing

The neuron strongly activates on technical two‐word phrases where a modifier (like an adjective or preposition) pairs with a noun—e.g. “concrete class,” “extended interface,” “in vivo,” “ideal number,” “corn fields,” “another list.”

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

Läh

-1.08

ể

-1.02

 emplear

-1.02

ſte

-1.02

 hornear

-0.97

 fiabilidad

-0.96

 první

-0.95

vgili

-0.95

ėtų

-0.94

wym

-0.93

POSITIVE LOGITS

 also

1.34

 still

0.99

 normaux

0.95

hlander

0.92

 universo

0.92

 even

0.92

。

0.92

还是要

0.91

 faudrait

0.90

 difer

0.90

Activations Density 0.085%