INDEX

Explanations

terms related to loss, injury, and consequences in various contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 itself

-0.10

 ÑıÐºÐµ

-0.08

æīĢæľī

-0.07

apus

-0.07

bih

-0.07

.SDK

-0.07

hangi

-0.07

rame

-0.07

omor

-0.07

elix

-0.07

POSITIVE LOGITS

or

0.14

 either

0.12

 eller

0.10

 hoáº·c

0.09

 Ð¸Ð»Ð¸

0.09

 throughout

0.09

æĪĸ

0.09

 themselves

0.09

 oder

0.09

æĪĸèĢħ

0.09

Activations Density 0.058%