INDEX

Explanations

references to attacks and perpetrators in various contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

onta

-0.07

 Mounted

-0.07

hift

-0.07

ishop

-0.07

enton

-0.06

ellas

-0.06

oons

-0.06

 wishes

-0.06

piel

-0.06

abyrin

-0.06

POSITIVE LOGITS

 organizer

0.07

 planning

0.07

 aborted

0.07

addtogroup

0.06

Planning

0.06

 Abort

0.06

 operatives

0.06

 plotting

0.06

 abort

0.06

odor

0.06

Activations Density 0.003%