INDEX

Explanations

terms related to animal control and responsibility

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

deniz

-0.08

ï¸

-0.08

sky

-0.08

ddit

-0.07

Ø¬Ø§

-0.07

Ã³i

-0.07

ampo

-0.06

mile

-0.06

ronic

-0.06

.jsp

-0.06

POSITIVE LOGITS

reads

0.06

 breed

0.06

VERTISE

0.06

 mediation

0.06

 Breed

0.06

 mediator

0.05

 Lust

0.05

ĥ

0.05

PLAN

0.05

Activations Density 0.004%