INDEX

Explanations

concepts related to defense and protection

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

adow

-0.06

 operator

-0.06

imo

-0.06

 cond

-0.06

ì§¸

-0.06

Â¯ÃĤ

-0.06

Ð»ÑĥÐ³

-0.06

rame

-0.06

eren

-0.06

>=

-0.06

POSITIVE LOGITS

\Doctrine

0.07

éĻµ

0.06

 Crossing

0.06

iman

0.06

amsung

0.06

irebase

0.06

 Ø§Ø·ÙĦ

0.06

 mÃ¡

0.06

Ø±ÙĪØ²

0.06

à¤¾à¤¨à¤¨

0.06

Activations Density 0.027%