INDEX

Explanations

oxidase

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Sanctions

-0.60

 sanctions

-0.57

 certifications

-0.56

UpInside

-0.54

 Certifications

-0.54

 aDecoder

-0.52

AndEndTag

-0.51

 oxidase

-0.50

certification

-0.50

 kated

-0.49

POSITIVE LOGITS

<bos>

0.64

 ProtoMessage

0.58

Контак

0.57

 Numerade

0.56

ably

0.56

ViewFeatures

0.54

gnąć

0.54

enschappelijke

0.53

roën

0.53

дца

0.52

Activations Density 0.039%