INDEX

Explanations

Numerical ranges

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 actors

-0.07

 cross

-0.07

 crosses

-0.07

lass

-0.06

 failure

-0.06

 matcher

-0.06

 Ground

-0.06

 reading

-0.06

 replay

-0.06

 regional

-0.06

POSITIVE LOGITS

해요

0.07

네요

0.07

 semiclass

0.07

股份有限公司

0.07

arParams

0.07

 PropertyInfo

0.06

_BO

0.06

*pi

0.06

кра

0.06

-liter

0.06

Activations Density 0.009%

Numerical ranges

No Comments

No Known Activations