INDEX

Explanations

sure

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sure

-2.53

sure

-2.11

Sure

-1.78

 SURE

-1.77

 Sure

-1.71

certain

-1.59

 CERTAIN

-1.53

 certain

-1.52

 surely

-1.52

 Certain

-1.51

POSITIVE LOGITS

<bos>

0.62

 that

0.59

 impressions

0.56

to

0.54

 classes

0.54

 types

0.54

‘

0.53

 بيها

0.53

“

0.53

 locations

0.53

Activations Density 0.256%