INDEX

Explanations

block followed by specific context

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

embar

1.23

푅

1.22

stig

1.18

⎟

1.18

 areal

1.15

𝘧

1.14

𝘨

1.13

pher

1.13

еш

1.12

 Membership

1.12

POSITIVE LOGITS

buster

1.69

quote

1.50

busters

1.44

块

1.43

塊

1.33

塀

1.26

 Qué

1.24

notas

1.22

tober

1.19

chains

1.19

Activations Density 0.064%