INDEX

Explanations

person names and scores

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ruim

-0.81

arrón

-0.78

polizei

-0.77

zij

-0.77

digt

-0.76

eras

-0.75

atikan

-0.73

 Historical

-0.73

ăng

-0.72

quila

-0.72

POSITIVE LOGITS

 adapted

0.94

 string

0.88

hese

0.77

 estimated

0.77

 estudar

0.76

 estudia

0.76

 updateUser

0.75

 adapting

0.75

 senseless

0.74

 cờ

0.73

Activations Density 0.000%