INDEX

Explanations

into

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 aerobic

-0.07

compound

-0.06

 Album

-0.06

 camino

-0.06

áno

-0.06

 Record

-0.06

 disease

-0.06

anner

-0.06

 painter

-0.06

 daemon

-0.06

POSITIVE LOGITS

ประส

0.07

 rozdíl

0.06

 underst

0.06

.neo

0.06

 valign

0.06

(\

0.06

	function

0.06

ไข

0.06

textTheme

0.06

 spectators

0.06

Activations Density 0.016%

into

No Comments

No Known Activations

into

No Comments

No Known Activations