INDEX

Explanations

tri

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Giant

-0.07

:M

-0.06

_Name

-0.06

 eskort

-0.06

 почти

-0.06

regunta

-0.06

.memo

-0.06

۲۷

-0.06

.setUser

-0.06

 Astr

-0.06

POSITIVE LOGITS

 olduğundan

0.06

(always

0.06

 ordinances

0.06

 freshman

0.06

 {}));↵

0.06

 firefight

0.06

少し

0.06

 qualifies

0.06

 steadfast

0.06

 lawmakers

0.06

Activations Density 0.174%

tri

No Comments

No Known Activations

tri

No Comments

No Known Activations