INDEX

Explanations

Hotel rooms and suites

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

(make

-0.07

	atomic

-0.07

 Almost

-0.07

draulic

-0.07

ABOUT

-0.06

 הרפואי

-0.06

 communist

-0.06

.getSelectionModel

-0.06

unk

-0.06

 volunteering

-0.06

POSITIVE LOGITS

 channels

0.08

BOT

0.07

.property

0.07

花纹

0.07

集成

0.07

RIA

0.06

 melody

0.06

轿车

0.06

_CHANNELS

0.06

宠

0.06

Activations Density 0.014%

Hotel rooms and suites

No Comments

No Known Activations

Hotel rooms and suites

No Comments

No Known Activations