INDEX

Explanations

circuit

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 liberal

-0.06

 τους

-0.06

 dikke

-0.06

:".

-0.06

 яв

-0.06

schools

-0.06

orn

-0.06

mw

-0.06

bg

-0.06

-*

-0.06

POSITIVE LOGITS

 감독

0.07

_AUTO

0.07

 cutoff

0.06

لق

0.06

matchCondition

0.06

Inactive

0.06

 metabolism

0.06

 metabolic

0.06

 mortgage

0.06

 wholesale

0.06

Activations Density 0.006%

circuit

No Comments

No Known Activations

circuit

No Comments

No Known Activations