INDEX

Explanations

floating point numbers, code

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sunset

-0.07

 Failure

-0.07

 miniature

-0.07

.datas

-0.07

 dosage

-0.07

Professor

-0.07

Least

-0.07

Ranges

-0.07

 fairness

-0.07

Ai

-0.07

POSITIVE LOGITS

	Test

0.07

_was

0.06

很有

0.06

Bank

0.06

posting

0.06

=='

0.06

UK

0.06

建議

0.06

莽

0.06

吁

0.06

Activations Density 0.002%

floating point numbers, code

No Comments

No Known Activations

floating point numbers, code

No Comments

No Known Activations