INDEX

Explanations

immune

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 round

-0.92

 dietro

-0.80

Round

-0.80

 behind

-0.78

 Sociales

-0.77

曖昧さ回避

-0.76

ագրություններ

-0.75

 ddelweddau

-0.75

 otomatig

-0.74

 Behind

-0.74

POSITIVE LOGITS

 status

0.65

 sensitivity

0.56

ze

0.53

pro

0.51

 construction

0.50

↵↵

0.49

 state

0.49

 chinh

0.48

len

0.48

MigrationBuilder

0.48

Activations Density 0.286%