INDEX

Explanations

mathematical notation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Diweddarwch

-0.81

mektedir

-0.70

 nakalista

-0.69

ViewFeatures

-0.68

RepeatedField

-0.66

 Kraus

-0.66

Carcinogenicity

-0.65

 Cougars

-0.65

SharedDtor

-0.65

 Lesley

-0.64

POSITIVE LOGITS

}}_

1.37

}}_{\

0.87

 prisonniers

0.81

 nemici

0.80

 ennemis

0.77

 blessés

0.77

 Beller

0.76

 Divider

0.76

 normaux

0.75

 supérieurs

0.75

Activations Density 0.009%