INDEX

Explanations

instances of violent actions and related physical attributes

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

esteem

-0.07

arta

-0.07

Ð²Ð¸Ñī

-0.06

arnation

-0.06

 hypnot

-0.06

commons

-0.06

stry

-0.06

.getElementsByName

-0.06

asca

-0.06

schema

-0.06

POSITIVE LOGITS

æĺ¯åľ¨

0.10

 wherever

0.09

 anywhere

0.09

 location

0.08

åľ°çĤ¹

0.08

 táº¡i

0.08

 near

0.08

Anywhere

0.07

location

0.07

 Outside

0.07

Activations Density 0.114%