INDEX

Explanations

references to injury or trauma-related terms

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

avor

-0.08

ushima

-0.07

.ht

-0.07

iesel

-0.07

 preferredStyle

-0.07

azo

-0.06

Ð°ÐºÐ¾Ð½

-0.06

adies

-0.06

ãģ¤ãģ¶

-0.06

POSITIVE LOGITS

Ã³j

0.06

 Fach

0.06

on

0.06

DDS

0.05

ilerek

0.05

Soy

0.05

 è²

0.05

 Hoff

0.05

bel

0.05

 removal

0.05

Activations Density 0.098%