INDEX

Explanations

certifications or doctors

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

坞

-0.06

ῑ

-0.06

瞿

-0.06

 מבח

-0.06

 Netz

-0.06

黄石

-0.06

 בבית

-0.06

痴

-0.06

itbart

-0.06

achine

-0.06

POSITIVE LOGITS

 architecture

0.08

极大地

0.07

风险

0.07

("-",

0.07

 thuận

0.07

fixed

0.07

 attributable

0.07

global

0.07

\Contracts

0.07

	writer

0.06

Activations Density 0.001%

certifications or doctors

No Comments

No Known Activations

certifications or doctors

No Comments

No Known Activations