INDEX

Explanations

references to television shows and related media

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With LLAMA3.1-8B @ 28-llamascope-res-32k

Configuration

fnlp/Llama3_1-8B-Base-LXR-8x/Llama3_1-8B-Base-L28R-8x

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Features

32,768

Data Type

bfloat16

Hook Name

blocks.28.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

cerebras/SlimPajama-627B

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

adium

-0.15

iband

-0.14

 Starr

-0.14

-pad

-0.14

 */;↵

-0.14

 ÑĥÑģ

-0.14

Ã³m

-0.14

htag

-0.14

iances

-0.13

æ¢¨

-0.13

POSITIVE LOGITS

Ses

0.35

Ker

0.30

ker

0.27

 sesame

0.26

uppet

0.23

 puppet

0.23

 Bert

0.22

 Cookie

0.22

 Frag

0.21

SES

0.21

Activations Density 0.002%

references to television shows and related media

No Comments

No Known Activations

references to television shows and related media

No Comments

No Known Activations