INDEX

Explanations

puppets

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 etiquette

-0.08

'::

-0.07

 wxString

-0.07

Disposable

-0.07

 แบบ

-0.07

δει

-0.07

@@

-0.07

 спос

-0.07

_swap

-0.07

Kitchen

-0.07

POSITIVE LOGITS

ipay

0.06

 класс

0.06

-Pacific

0.06

 Annex

0.06

estro

0.06

 immortal

0.06

ryn

0.06

 college

0.06

 randomness

0.06

िलत

0.06

Activations Density 0.030%

puppets

No Comments

No Known Activations

puppets

No Comments

No Known Activations