INDEX

Explanations

introducing something

response starters like here or okay

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 \%)$

0.31

ებისთვის

0.31

 شوند

0.31

っちゃ

0.31

)」

0.29

_)

0.29

 detract

0.29

 deberían

0.29

ྥ

0.29

POSITIVE LOGITS

<h1>

0.79

When

0.79

<h4>

0.78

There

0.77

<h3>

0.76

<h2>

0.75

This

0.75

<blockquote>

0.73

While

0.72

<h5>

0.71

Activations Density 1.822%