INDEX

Explanations

expressions related to identity and cultural significance

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Sesso

-0.14

riter

-0.14

 gratuites

-0.14

è¶£

-0.14

edly

-0.14

 Bender

-0.13

onium

-0.13

quan

-0.13

orra

-0.13

oth

-0.13

POSITIVE LOGITS

ÙħØ¯

0.16

 Ð·Ð°Ð²Ð¸ÑģÐ¸Ð¼

0.15

ÑĤÐ°Ð¶

0.15

ruit

0.15

obil

0.14

uvo

0.14

ovÃ¡no

0.14

 createState

0.14

eing

0.14

unable

0.14

Activations Density 0.009%

expressions related to identity and cultural significance

No Comments

No Known Activations

expressions related to identity and cultural significance

No Comments

No Known Activations