INDEX

Explanations

familiarity with something

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

</i>

-1.80

↵

-1.75

繝

-1.68

 だっ

-1.64

to

-1.64

 diğer

-1.63

してた

-1.58

 econó

-1.56

蹰

-1.52

堕

-1.49

POSITIVE LOGITS

⩥

1.91

的一個

1.82

 какое

1.77

();

1.77

 Füßen

1.74

槩

1.74

纐

1.68

尔夫

1.62

玙

1.61

fang

1.59

Activations Density 0.010%