INDEX

Explanations

phrases that indicate the conclusion or ending of thoughts

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oss

-0.17

le

-0.15

ich

-0.15

otal

-0.15

<<<<<<<

-0.15

sel

-0.14

iche

-0.14

leme

-0.14

och

-0.14

à¹īà¸²à¸Ĭ

-0.14

POSITIVE LOGITS

icina

0.18

-the

0.17

abbo

0.17

ushima

0.16

 thumb

0.16

Ð°Ð½Ð¾Ð²

0.16

course

0.16

afone

0.16

 affairs

0.16

 course

0.16

Activations Density 0.052%

phrases that indicate the conclusion or ending of thoughts

No Comments

No Known Activations