INDEX

Explanations

distracted driving

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

elementProp

0.58

驅

0.57

 driven

0.55

 stagn

0.54

 Bilbao

0.52

 drive

0.52

 estran

0.52

夭

0.52

 Income

0.51

 Driven

0.51

POSITIVE LOGITS

 distracted

1.41

 distraction

1.24

 distractions

1.22

 drows

1.18

 distract

1.11

 fatigued

1.05

 careless

1.04

 intoxicated

1.04

 drowsiness

1.04

 distra

1.02

Activations Density 0.143%