INDEX

Explanations

numbers

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 handicap

-0.07

oltage

-0.07

松

-0.06

 waves

-0.06

wc

-0.06

 begged

-0.06

Mac

-0.06

	return

-0.06

Working

-0.06

heels

-0.06

POSITIVE LOGITS

 mainAxisAlignment

0.07

 straightforward

0.07

INIT

0.06

:invoke

0.06

 chance

0.06

-thinking

0.06

rtl

0.06

senha

0.06

ीम

0.06

.ACCESS

0.06

Activations Density 0.052%

numbers

No Comments

No Known Activations