INDEX

Explanations

evidence

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

iÄŁi

-0.23

*pow

-0.22

unprocessable

-0.22

å²Ļ

-0.22

çļĦè¯Ŀé¢ĺ

-0.21

*Math

-0.21

åĹĮ

-0.21

èı¹

-0.20

åıĺåİĭ

-0.20

çļĦåħ³æ³¨

-0.19

POSITIVE LOGITS

è¯ģæį®

0.47

 evidence

0.47

è¯ģå®ŀ

0.36

ç»ĵè®º

0.35

 Evidence

0.34

 supporting

0.32

è¯ģæĺİ

0.32

åı¯ä¿¡

0.31

 facts

0.29

è¯ģ

0.29

Activations Density 2.242%