INDEX

Explanations

references to the term "horizon."

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

erman

-0.20

een

-0.20

tle

-0.19

ermen

-0.18

eenth

-0.18

holm

-0.17

dom

-0.17

nik

-0.16

drawing

-0.16

ÙħØ§ÙĨÛĮ

-0.16

POSITIVE LOGITS

izons

0.24

izont

0.22

ìŀ¡

0.21

izon

0.19

/back

0.18

arium

0.18

izontally

0.17

izontal

0.17

iginal

0.16

line

0.16

Activations Density 0.013%

references to the term "horizon."

No Comments

No Known Activations