INDEX

Explanations

General English phrases

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

TagMode

-0.86

afficheront

-0.78

webElementXpaths

-0.78

 <>",

-0.75

AndEndTag

-0.73

contentLoaded

-0.73

 Roskov

-0.72

devamını

-0.71

 بيها

-0.69

 myſelf

-0.68

POSITIVE LOGITS

 fact

0.76

 Facts

0.69

 facts

0.66

Facts

0.66

Fact

0.63

 Fact

0.61

 FACTS

0.55

facts

0.54

 FACT

0.54

 hecho

0.54

Activations Density 0.002%