INDEX

Explanations

references to violence and environmental disasters

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

pii

-0.07

Ã¤tz

-0.07

uffers

-0.07

gnore

-0.07

fik

-0.07

oyo

-0.07

ä¼Ĺ

-0.07

 ÑĥÐ»ÑĭÐ±

-0.06

/place

-0.06

azed

-0.06

POSITIVE LOGITS

 report

0.07

AGAIN

0.06

 alarm

0.06

lament

0.06

 worry

0.06

yh

0.06

 Alarm

0.06

 compared

0.06

 Report

0.06

 reports

0.06

Activations Density 0.023%