INDEX

Explanations

words related to various types of "-ism" concepts or tendencies

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

zilla

-0.08

iÄįka

-0.07

ÑĩÐ¸ÑĤ

-0.07

chap

-0.07

inine

-0.07

 geil

-0.07

tober

-0.07

anson

-0.07

Ã½t

-0.07

amination

-0.06

POSITIVE LOGITS

keit

0.08

oux

0.07

dale

0.07

emma

0.06

otle

0.06

otent

0.06

opher

0.06

ear

0.06

dling

0.06

 myself

0.06

Activations Density 0.034%