INDEX

Explanations

references to pledges, promises, and commitments

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With LLAMA3.1-8B @ 28-llamascope-res-32k

Configuration

fnlp/Llama3_1-8B-Base-LXR-8x/Llama3_1-8B-Base-L28R-8x

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Features

32,768

Data Type

bfloat16

Hook Name

blocks.28.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

cerebras/SlimPajama-627B

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

orz

-0.16

Ø¨Ø·

-0.16

vidia

-0.15

quo

-0.15

 geile

-0.15

Ø§Ø³Ø·Ø©

-0.15

unkt

-0.14

Tcp

-0.14

 à¤¸à¤²

-0.13

oucher

-0.13

POSITIVE LOGITS

 promise

0.92

 promises

0.84

 Promise

0.77

 commitments

0.71

 commitment

0.70

promise

0.70

 pledge

0.69

 promised

0.69

Promise

0.67

 pledges

0.64

Activations Density 0.394%

references to pledges, promises, and commitments

No Comments

No Known Activations