INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

1.83

1.53

ure

1.29

?,

1.25

 היא

1.25

_,

1.19

ur

1.18

pd

1.18

furt

1.17

 就是

1.16

POSITIVE LOGITS

सिला

1.57

 contralateral

1.51

িকারী

1.44

 philosophical

1.37

叐

1.37

 behavioral

1.37

 হইয়াছিলেন

1.35

 করিয়

1.35

ઠવા

1.33

anlı

1.31

Activations Density 0.004%

No Known Activations