INDEX

Explanations

aggregate

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 слу

-0.07

ti

-0.07

 well

-0.06

 strict

-0.06

ências

-0.06

.Warning

-0.06

.ci

-0.06

 currentTime

-0.06

nil

-0.06

.NoArgsConstructor

-0.06

POSITIVE LOGITS

 aggregate

0.13

 aggregates

0.13

 aggregation

0.12

 Aggregate

0.12

 aggregated

0.12

Aggregate

0.11

 aggregator

0.11

aggregate

0.10

 aggreg

0.10

greg

0.09

Activations Density 0.006%

aggregate

No Comments

No Known Activations

aggregate

No Comments

No Known Activations