INDEX

Explanations

grant

np_max-act · gemini-2.0-flash

terms related to intellectual property and its legal rights.

oai_token-act-pair · gpt-4o-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Runs

-0.08

 convers

-0.07

 inte

-0.07

.tif

-0.07

 rico

-0.07

 blowing

-0.06

 picked

-0.06

 Manager

-0.06

	save

-0.06

 sanit

-0.06

POSITIVE LOGITS

 fikir

0.07

опрос

0.06

Outlet

0.06

getClass

0.06

Course

0.06

 trovare

0.06

STRUCTOR

0.06

 française

0.06

 trục

0.06

 bufferSize

0.06

Activations Density 0.001%

grant

terms related to intellectual property and its legal rights.

No Comments

No Known Activations

grant

terms related to intellectual property and its legal rights.

No Comments

No Known Activations