INDEX

Explanations

Addresses and identifiers

np_max-act · gemini-2.0-flash

technical terminology related to quantitative finance and statistical modeling.

oai_token-act-pair · gpt-4o-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Dirty

-0.07

 breathe

-0.06

 tiger

-0.06

 Microsoft

-0.06

 governments

-0.06

ived

-0.06

 дослідження

-0.06

ultipart

-0.06

 Pleasant

-0.06

 بأن

-0.06

POSITIVE LOGITS

<Entity

0.07

ıda

0.06

[J

0.06

'|

0.06

owej

0.06

스의

0.06

�

0.06

(slot

0.06

ulario

0.06

只

0.06

Activations Density 0.002%

Addresses and identifiers

technical terminology related to quantitative finance and statistical modeling.

No Comments

No Known Activations