INDEX

Explanations

disintegration

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

RegressionTest

-0.60

Hentet

-0.58

 Ooster

-0.56

 betweenstory

-0.55

 Filling

-0.55

 Fill

-0.53

 يتيمه

-0.52

 filling

-0.50

-0.49



-0.49

POSITIVE LOGITS

 dissolve

0.68

unc

0.66

 thin

0.65

 break

0.65

 loosen

0.64

 clear

0.60

dis

0.59

 liqu

0.59

 Dissolve

0.59

 melt

0.57

Activations Density 0.000%