words that refer to risks or challenges

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ëı

-0.07

reau

-0.07

aqu

-0.06

cke

-0.06

 Latina

-0.06

 handwriting

-0.06

whose

-0.06

inite

-0.06

ular

-0.06

ushi

-0.06

POSITIVE LOGITS

swer

0.08

 ØªÙĪØ³Ø·

0.08

by

0.07

 oleh

0.07

Ã¢l

0.06

yon

0.06

orny

0.06

Ø¯Ø±Ø³

0.06

Activations Density 0.016%