INDEX

Explanations

terms related to patient safety and healthcare risk management

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__res-sm_processed/4-res-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.4.hook_resid_post

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 defence

-2.35

 offence

-2.32

 flavour

-2.05

 colour

-2.00

 colours

-1.91

 Defence

-1.79

 humour

-1.78

 mould

-1.76

 organisation

-1.75

’.

-1.65

POSITIVE LOGITS

ľĵ

3.87

Ī

3.84

ī

3.84

į

3.83

ĥ½

3.83

↵        ↵

3.82

↵↵

3.82

↵

3.82

↵↵

3.82

Activations Density 2.890%

terms related to patient safety and healthcare risk management

No Comments

No Known Activations

terms related to patient safety and healthcare risk management

No Comments

No Known Activations