INDEX

Explanations

bile ducts

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

zza

-0.07

 adjective

-0.07

 astro

-0.07

луг

-0.06

-lg

-0.06

一页

-0.06

.dictionary

-0.06

 sinus

-0.06

िजल

-0.06

 человечес

-0.06

POSITIVE LOGITS

idding

0.07

ково

0.07

 junction

0.07

 Kimberly

0.07

icester

0.07

ensively

0.06

 Grey

0.06

 rallied

0.06

nee

0.06

 Preston

0.06

Activations Density 0.022%

bile ducts

No Comments

No Known Activations