INDEX

Explanations

prefix plus suffix forming words

prefix indicating negation or negativity

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 большая

0.30

提供

0.28

锯

0.26

指定

0.26

 فقط

0.26

 ONLY

0.26

 повністю

0.26

除了

0.25

 tanque

0.25

BFF

0.25

POSITIVE LOGITS

 admon

0.26

 impover

0.23

political

0.23

 despot

0.22

 semblable

0.22

 idolat

0.22

{\'

0.21

 tormented

0.21

 uprisings

0.21

ulatory

0.21

Activations Density 0.714%