INDEX

Explanations

actions related to physical altercations and confrontations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

entes

-0.08

ecta

-0.07

AuthProvider

-0.07

reon

-0.07

mlin

-0.07

aidu

-0.06

ÑĢÑĥ

-0.06

rijk

-0.06

iras

-0.06

endon

-0.06

POSITIVE LOGITS

 struggle

0.07

 Ingram

0.06

 struggling

0.06

anel

0.06

 struggles

0.06

hy

0.06

 Dickens

0.06

 Daniel

0.06

 struggled

0.06

ocos

0.06

Activations Density 0.007%