INDEX

Explanations

describing how something is done

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

鋱

-2.75

﹆

-2.73

aktionen

-2.61

 alojamientos

-2.59

dáms

-2.59

 ruinas

-2.59

 みた

-2.53

tentu

-2.45

 profondo

-2.42

熺

-2.42

POSITIVE LOGITS

<td>

3.31

3.30

This

2.64

ка

2.55

<em>

2.53

 Many

2.52

re

2.42

是被

2.39

 From

2.36

on

2.30

Activations Density 0.004%