INDEX

Explanations

especially particular context

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 verwendeten

0.39

origine

0.38

வீன

0.38

}^{*

0.38

 exigences

0.37

enz

0.36

 ਅਤੇ

0.36

 ಉತ್ಪನ್ನ

0.36

durch

0.35

}$;

0.35

POSITIVE LOGITS

 camo

0.49

！！！！

0.47

！！！

0.43

 magari

0.43

 অনেক

0.42

!!!!

0.41

 congrats

0.40

 whitelist

0.40

ssd

0.40

 everytime

0.40

Activations Density 0.012%