INDEX

Explanations

code examples, see

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

va

-0.07

 пациент

-0.07

max

-0.07

 left

-0.06

高涨

-0.06

 expected

-0.06

oud

-0.06

 volt

-0.06

 המקומי

-0.06

 Leia

-0.06

POSITIVE LOGITS

_finish

0.07

局限

0.07

'>

0.07

 цена

0.07

.Constraint

0.07

('?

0.07

 applyMiddleware

0.07

formedURLException

0.06

';'

0.06

_building

0.06

Activations Density 0.162%

code examples, see

No Comments

No Known Activations