INDEX

Explanations

offensive terms

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 molec

-0.30

 cocci

-0.28

strstr

-0.27

çŁ¿çī©è´¨

-0.26

 cryst

-0.24

 createContext

-0.23

Tumblr

-0.22

æģ

-0.22

chez

-0.22

 metic

-0.22

POSITIVE LOGITS

 misunder

0.39

 disadv

0.34

 rapes

0.31

"struct

0.30

 bigot

0.29

 Erectile

0.29

 raping

0.29

ä¹łè¿ĳå¹³æĸ°

0.28

 sodom

0.28

 èĩªåĬ¨çĶŁæĪĲ

0.28

Activations Density 6.318%