INDEX

Explanations

expressions of causality

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

");

-1.79

);

-1.78

is

-1.63

 acompañada

-1.61

 acompañado

-1.52

);

-1.41

'];

-1.39

 extremadamente

-1.39

👎

-1.38

ttps

-1.38

POSITIVE LOGITS

the

1.81

 maintenant

1.77

 depuis

1.70

 jetzt

1.70

our

1.58

liebte

1.55

 ahora

1.52

 sekarang

1.52

好吧

1.51

憷

1.48

Activations Density 0.014%