INDEX

Explanations

decode, decompress, parse, undo

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

encrypt

-0.14

 encrypt

-0.14

_encrypt

-0.13

 Encrypt

-0.13

.encrypt

-0.12

Encrypt

-0.12

.SerializeObject

-0.11

.compress

-0.11

.serialize

-0.10

.Escape

-0.10

POSITIVE LOGITS

è§£

0.24

 decode

0.20

 è§£

0.19

 inverse

0.19

 reverse

0.19

 undo

0.18

Dec

0.18

 decoding

0.17

decode

0.17

undo

0.17

Activations Density 0.081%