INDEX

Explanations

effort and best undertakings

The neuron spotlights hedge/disclaimer language—phrases expressing efforts or commitments (e.g. “we make every effort,” “we strive,” “our best to ensure”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

-1.55

くれました

-1.52

もっと

-1.51

 secciones

-1.45

 specifically

-1.43

 bahía

-1.37

now

-1.37

</strong>

-1.36

</h1>

-1.33

 tradiciones

-1.31

POSITIVE LOGITS

to

1.70

at

1.56

ISTRATION

1.26

ātu

1.25

All

1.25

While

1.23

1.21

just

1.19

Three

1.19

one

1.18

Activations Density 0.022%