INDEX

Explanations

explains

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 explains

-0.94

 Explains

-0.85

 مشين

-0.80

 برانيه

-0.77

]**

-0.74

AnimationsModule

-0.72

 nahilalakip

-0.71

Bakgrunnsstoff

-0.71

 חיצוניים

-0.71

 explique

-0.69

POSITIVE LOGITS

 what

0.79

how

0.75

 that

0.70

0.66

the

0.64

by

0.61

 where

0.60

and

0.57

it

0.56

Activations Density 0.742%