INDEX

Explanations

Code special characters

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

寓意

-0.07

ONE

-0.07

茹

-0.07

 went

-0.07

面对面

-0.07

.element

-0.07

室内

-0.07

浦

-0.06

 gone

-0.06

عبد

-0.06

POSITIVE LOGITS

 vời

0.07

 décou

0.07

 hảo

0.07

 frec

0.07

 Funktion

0.07

直辖市

0.07

 cardio

0.07

Глав

0.07

cao

0.07

 escrit

0.07

Activations Density 0.001%

Code special characters

No Comments

No Known Activations

Code special characters

No Comments

No Known Activations