INDEX

Explanations

dialogue or quotes that indicate someone's opinion or statement

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Ease

-0.15

ÃĹ↵↵

-0.13

Î»Î¬

-0.13

 kred

-0.13

uro

-0.13

avo

-0.12

 ÑĤÐµÑĩ

-0.12

lie

-0.12

ØªÙĨ

-0.12

ander

-0.12

POSITIVE LOGITS

one

0.38

 said

0.22

ä¸Ģä¸ª

0.21

 má»Ļt

0.20

an

0.19

 eines

0.19

ä¸ĢåĢĭ

0.19

 longtime

0.19

 ÛĮÚ©ÛĮ

0.18

Activations Density 0.039%

dialogue or quotes that indicate someone's opinion or statement

No Comments

No Known Activations