INDEX

Explanations

say 'duration'

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 UIButton

-0.06

reiben

-0.06

 pencil

-0.06

 shape

-0.06

_CENTER

-0.06

Assembler

-0.06

shape

-0.06

 aşırı

-0.06

 savedInstanceState

-0.06

scope

-0.06

POSITIVE LOGITS

.getB

0.06

.direct

0.06

отреб

0.06

 struggled

0.06

‐

0.06

(WIN

0.06

DOS

0.06

 André

0.06

 Messiah

0.06

GDP

0.06

Activations Density 0.033%

say 'duration'

No Comments

No Known Activations

say 'duration'

No Comments

No Known Activations