INDEX

Explanations

learning swimming in system

jokes and factual statements

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

؛

0.51

 hôtels

0.51

 فِي

0.49

會

0.49

岕

0.49

 senhores

0.48

idk

0.48

劉

0.48

 المملكة

0.48

 equipments

0.47

POSITIVE LOGITS

are

0.60

can

0.57

 अन्य

0.56

two

0.55

 інші

0.54

 will

0.54

𝑏

0.53

 तीन

0.51

 অন্যান্য

0.50

 અન્ય

0.50

Activations Density 2.515%