INDEX

Explanations

content

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

UnusedPrivate

-0.89

 виправивши

-0.87

 متعلقه

-0.85

Efq

-0.81

AutoScaleMode

-0.81

 betweenstory

-0.80

 surla

-0.78

RegressionTest

-0.77

 الاطلاع

-0.74

 تانيه

-0.73

POSITIVE LOGITS

 management

0.52

 bowl

0.46

िन्न

0.45

 جلو

0.45

 Management

0.44

gé

0.44

Dog

0.43

窝

0.43

 generator

0.43

 Jail

0.43

Activations Density 0.013%