INDEX

Explanations

Massive objects in science

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 empleado

-0.08

めて

-0.07

.cc

-0.07

езжа

-0.07

 obligation

-0.07

ཅ

-0.07

_ATTACHMENT

-0.07

 integrated

-0.07

 lecken

-0.07

.Register

-0.07

POSITIVE LOGITS

 rivalry

0.07

绦

0.07

Computer

0.07

font

0.07

 תוכ

0.07

互利

0.07

 affili

0.07

TES

0.07

 Privacy

0.07

nav

0.07

Activations Density 0.037%

Massive objects in science

No Comments

No Known Activations