INDEX

Explanations

elements related to investigations and reports of threats or actions taken

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ut

-0.06

ÑĮ

-0.06

ÂŃt

-0.06

af

-0.06

 ÂŃ

-0.06

raw

-0.06

ugg

-0.06

Ð·

-0.06

lef

-0.06

 bite

-0.05

POSITIVE LOGITS

istrovstvÃŃ

0.07

ebi

0.06

ombat

0.06

anja

0.06

prs

0.06

oya

0.06

azzo

0.06

Ð²Ð°Ð¶

0.06

ept

0.06

bast

0.06

Activations Density 0.015%