INDEX

Explanations

information and concepts

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 taill

0.68

 מספר

0.61

た

0.59

 feuille

0.59

 anciennes

0.59

た

0.59

 क्रमांक

0.58

ECB

0.57

 található

0.57

तै

0.57

POSITIVE LOGITS

送

0.55

គ្នា

0.53

няются

0.53

 safer

0.52

自信

0.52

 censoring

0.52

 bracing

0.51

信念

0.51

లో

0.50

意義

0.50

Activations Density 0.000%