INDEX

Explanations

form input and label generation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Adrien

-0.91

恂

-0.91

ivity

-0.91

 potreb

-0.90

Warsaw

-0.85

 anvä

-0.84

setCellStyle

-0.84

ἔ

-0.83

 darbu

-0.81

ذر

-0.80

POSITIVE LOGITS

Paul

0.93

 Hidden

0.88

 衣服

0.85

 있으며

0.85

Hidden

0.85

 bát

0.84

ることができる

0.83

 płasz

0.82

 said

0.81

既然

0.81

Activations Density 0.009%