INDEX

Explanations

the word 'not' and the word 'first'

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

featureID

-0.63

ViewFeatures

-0.59

 Walkover

-0.56

ionales

-0.56

 dipende

-0.56

 udaler

-0.54

WriteAttribute

-0.54

Dunn

-0.54

 couvert

-0.53

etchup

-0.53

POSITIVE LOGITS

nth

0.85

ConstraintMaker

0.66

ChildScrollView

0.64

__(/*!

0.63

prost

0.60

matchCondition

0.58

...”

0.57

!”

0.56

!”,

0.56

”

0.56

Activations Density 0.003%