INDEX

Explanations

time/amount needed

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

JSON

-0.06

 Puzzle

-0.06

uddled

-0.06

یدی

-0.06

 vile

-0.06

 trivial

-0.06

architecture

-0.06

 impressive

-0.06

atrix

-0.06

 Straw

-0.06

POSITIVE LOGITS

 teardown

0.07

 zájem

0.07

 proceeds

0.07

chedule

0.06

 schematic

0.06

 rejuven

0.06

 خوب

0.06

deadline

0.06

 steer

0.06

 запит

0.06

Activations Density 0.041%

time/amount needed

No Comments

No Known Activations