INDEX

Explanations

the followed by a specific noun

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

があるので

-0.90

 IMMEDIATE

-0.86

вшим

-0.85

只不过

-0.84

 impecable

-0.82

 BEAUT

-0.81

ĥ

-0.80

 aiment

-0.79

hlten

-0.79

 clark

-0.78

POSITIVE LOGITS

 their

1.02

 huge

1.00

 thrilling

0.97

 actions

0.96

 existing

0.94

 fierce

0.91

 evaluation

0.91

new

0.91

 benefits

0.90

 various

0.90

Activations Density 0.011%