INDEX

Explanations

phrases or terms related to anti-Semitism and related ideological assertions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Straight

-0.07

Ð´Ð°Ð¿

-0.06

ymph

-0.06

Pil

-0.06

inel

-0.06

osph

-0.06

 Plain

-0.06

itsu

-0.06

 Planned

-0.06

Ð¼ÐµÐ»ÑĮ

-0.06

POSITIVE LOGITS

awy

0.07

alla

0.07

ahy

0.07

icit

0.07

UCKET

0.07

EMENT

0.07

Ø³ØªÙĩ

0.07

wert

0.06

ãĥĲãĥ¼

0.06

wc

0.06

Activations Density 0.000%