INDEX

Explanations

expressions related to participation and involvement in various activities

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 pornos

-0.10

âĪı

-0.09

abwe

-0.09

 Ð¾Ð³ÑĢÐ°

-0.09

Å¯l

-0.09

Ð³Ð°Ð»Ñĸ

-0.09

_contrib

-0.09

erus

-0.09

vise

-0.08

erva

-0.08

POSITIVE LOGITS

id

0.07

 oneself

0.06

and

0.06

ment

0.06

SD

0.06

aling

0.06

illing

0.06

itude

0.06

Activations Density 0.127%