INDEX

Explanations

Technical/scientific contexts

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

png

-0.07

 Monk

-0.07

PST

-0.07

 بعض

-0.07

牿

-0.06

⺫

-0.06

隨著

-0.06

行業

-0.06

简直就是

-0.06

ของเรา

-0.06

POSITIVE LOGITS

ffer

0.07

Gew

0.07

�

0.06

toupper

0.06

ﻤ

0.06

 Children

0.06

bcm

0.06

ederland

0.06

ĕ

0.06

 lastName

0.06

Activations Density 0.573%

Technical/scientific contexts

No Comments

No Known Activations

Technical/scientific contexts

No Comments

No Known Activations