INDEX

Explanations

|

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 quand

-0.06

rnd

-0.06

 Translation

-0.06

 DialogResult

-0.06

 Hoch

-0.06

 painful

-0.06

 внимание

-0.06

 что

-0.06

 улучш

-0.06

Elim

-0.06

POSITIVE LOGITS

-machine

0.06

чается

0.06

 Token

0.06

tile

0.06

 weakening

0.06

ていた

0.06

_up

0.06

ore

0.06

 learning

0.06

��

0.06

Activations Density 0.265%

|

No Comments

No Known Activations