INDEX

Explanations

Research papers

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 vested

-0.07

 출시

-0.06

drink

-0.06

 Beit

-0.06

 Comparative

-0.06

 alot

-0.06

-gun

-0.06

 joys

-0.06

_io

-0.06

PHONE

-0.06

POSITIVE LOGITS

-coded

0.08

gambar

0.07

 nour

0.07

Susp

0.07

 elasticity

0.07

 spirits

0.06

 sunglasses

0.06

 prem

0.06

 Moder

0.06

detail

0.06

Activations Density 0.027%

Research papers

No Comments

No Known Activations

Research papers

No Comments

No Known Activations