INDEX

Explanations

references to military personnel and their involvement in human rights violations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

benh

-0.08

arshal

-0.08

 annunci

-0.08

Ð»Ð°ÑĤÐ¸

-0.08

.Atomic

-0.08

 âĹĦ

-0.08

Ä±ÅŁÄ±k

-0.07

 Annunci

-0.07

 zatÃŃm

-0.07

xfa

-0.07

POSITIVE LOGITS

 during

0.07

 deeds

0.07

om

0.06

 allegedly

0.06

SS

0.06

GB

0.06

ott

0.06

gen

0.05

 activities

0.05

Wil

0.05

Activations Density 0.002%