INDEX

Explanations

technical documentation citing specific terms

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sekä

-1.03

 walaupun

-0.96

 provocó

-0.92

 kerana

-0.90

鮭

-0.88

出声

-0.86

 llorando

-0.86

ങ്

-0.85

����

-0.85

��������

-0.83

POSITIVE LOGITS

是你

0.91

Keine

0.90

엥

0.90

うえ

0.89

 होना

0.89

ις

0.88

 áng

0.85

Dni

0.85

たびに

0.85

コンピュー

0.84

Activations Density 0.000%