INDEX

Explanations

mixed code/documentation

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 separately

-0.76

 independently

-0.71

 manually

-0.67

 peacefully

-0.65

 individually

-0.64

 orally

-0.64

 verbally

-0.60

 efficiently

-0.60

 sequentially

-0.59

 dynamically

-0.59

POSITIVE LOGITS

éļĲèĹı

0.29

rys

0.26

ä¸įçķĻ

0.25

çĻ¾å§ĵ

0.25

ç®¡çĲĨæ¨¡å¼ı

0.25

åľ»

0.25

æľĢåĲİä¸Ģ

0.25

åŃĺåľ¨äºİ

0.24

è¶ĭåĲĳ

0.24

æĹ¶ä»£ä¸ŃåĽ½

0.24

Activations Density 0.907%