INDEX

Explanations

expressions that indicate a need for awareness or consideration of differences

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

llib

-0.09

/fw

-0.08

alette

-0.08

bai

-0.07

endcode

-0.07

@js

-0.07

HeaderCode

-0.07

urette

-0.07

ogh

-0.07

latent

-0.07

POSITIVE LOGITS

 even

0.06

anda

0.06

ongo

0.06

/*

0.06

 whatever

0.06

--

0.06

/*

0.06

 insign

0.05

 perhaps

0.05

Activations Density 0.000%