INDEX

Explanations

structural words at the beginning of sentences or paragraphs in academic papers

oai_token-act-pair · gemini-2.0-flash

New Auto-Interp

Configuration

fnlp/Llama-Scope-R1-Distill/400M-Slimpajama-400M-OpenR1-Math-220k/L17R

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Hzfinfdu/SlimPajama-3B and open-r1/OpenR1-Math-220k

Features

32,768

Data Type

float32

Hook Name

blocks.17.hook_resid_post

Architecture

jumprelu

Context Size

1,024

Dataset

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

geries

-0.07

Hao

-0.07

antar

-0.07

 ëĭ¤ìļ´ë°Ľê¸°

-0.06

eb

-0.06

ses

-0.06

bable

-0.06

sert

-0.06

bris

-0.06

 íĮĮìĿ¼ì²¨ë¶Ģ

-0.06

POSITIVE LOGITS

orem

0.12

amp

0.09

oretical

0.07

iming

0.06

igh

0.06

odor

0.06

Åĵ

0.06

ory

0.06

notated

0.06

ories

0.06

Activations Density 0.656%

structural words at the beginning of sentences or paragraphs in academic papers

No Comments

No Known Activations