INDEX

Explanations

stating

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

finder

-0.06

firebase

-0.06

 نيز

-0.06

 Imagine

-0.06

 amnesty

-0.06

ember

-0.06

 نمای

-0.06

blind

-0.06

 nutrients

-0.06

 grey

-0.06

POSITIVE LOGITS

 stated

0.11

 stating

0.09

 Patel

0.08

 Ст

0.08

_Output

0.07

 addressing

0.07

 titled

0.07

 startIndex

0.07

ΜΑ

0.06

-st

0.06

Activations Density 0.010%

stating

No Comments

No Known Activations

stating

No Comments

No Known Activations