INDEX

Explanations

starts with Q:

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 låter

-2.03

 impecable

-1.89

 fortsätter

-1.80

on

-1.79

ीक

-1.77

of

-1.77

 behövs

-1.74

ata

-1.73

ak

-1.69

 problemet

-1.67

POSITIVE LOGITS

 klient

1.89

 triko

1.80

 most

1.77

chables

1.77

all

1.77

 prestigious

1.73

Lastly

1.73

ALL

1.68

 more

1.67

 February

1.66

Activations Density 0.002%