INDEX

Explanations

research/technical documents

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 AssemblyCompany

-0.73

 transfieras

-0.69

 Roskov

-0.68



-0.64

 vanta

-0.61

 converges

-0.60

αρα

-0.59

ंदीखरीदारी

-0.57

mbolos

-0.56

Weiterlesen

-0.56

POSITIVE LOGITS

riwal

0.55

apnews

0.54

存于互联网档案馆

0.53

0.52

phosa

0.50

avax

0.50

wand

0.48

 otomatig

0.48

lactin

0.48

Sur

0.48

Activations Density 0.003%