INDEX

Model

gemma-2-9b-it

Layer #

Steering Hook

blocks.20.hook_resid_pre

Steering Strength

53.75

Uploader

bot-neuronpedia

Created At

2/15/2025 1:06:43 AM

Raw Vector

Actions

Explanations

words related to programming terminology involving strings, numbers, and data structures

oai_token-act-pair · gpt-4o-mini

New Auto-Interp

Configuration

pyvene/gemma-reft-r1-9b-it-res/l20

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ویکی‌پدی

-0.66

)':

-0.60

]]:

-0.59

})$}

-0.59

 }}$}

-0.58

"]))

-0.57

"]]

-0.56

)}-

-0.55

")"

-0.55

"}},

-0.55

POSITIVE LOGITS

 betweenstory

0.49

 Infór

0.46

 gedrag

0.45

MessageTagHelper

0.44

tagHelper

0.44

 desnuda

0.43

Története

0.43

 skolan

0.42

őzés

0.41

transQ

0.41

Activations Density 0.004%

words related to programming terminology involving strings, numbers, and data structures

No Comments

No Known Activations

words related to programming terminology involving strings, numbers, and data structures

No Comments

No Known Activations