INDEX

Explanations

/

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 photographers

-0.08

:result

-0.07

 осіб

-0.06

Tesla

-0.06

 LSTM

-0.06

 Countdown

-0.06

StatusCode

-0.06

 shepherd

-0.06

 XCTAssertEqual

-0.06

 reservoir

-0.06

POSITIVE LOGITS

rom

0.07

تل

0.06

 Unified

0.06

 tattoos

0.06

rip

0.06

Ve

0.06

_UNDER

0.06

/auth

0.06

Va

0.06

得

0.06

Activations Density 0.012%

/

No Comments

No Known Activations