INDEX

Explanations

references to violations of laws or rules

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.scalablytyped

-0.09

Î»Î¹

-0.09

utory

-0.08

arde

-0.08

onde

-0.08

æŀĿ

-0.07

onga

-0.07

czy

-0.07

.initializeApp

-0.07

ellite

-0.07

POSITIVE LOGITS

 rules

0.11

 laws

0.10

 norms

0.09

 orders

0.08

by

0.08

 principles

0.07

 regulations

0.07

rules

0.07

 expectations

0.07

 bounds

0.07

Activations Density 0.020%