INDEX

Explanations

adverbs and discourse markers

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

kuuta

-0.53

are

-0.52

 starve

-0.45

 visit

-0.43

 nale

-0.43

TRIBUT

-0.42

 relate

-0.42

 converse

-0.42

 imitate

-0.42

 wieś

-0.42

POSITIVE LOGITS

 wants

0.94

 publishes

0.92

 sells

0.88

 owns

0.88

 recognizes

0.86

 uses

0.85

 considers

0.82

 defines

0.82

 chooses

0.82

 refuses

0.81

Activations Density 0.018%