INDEX

Explanations

phrases starting with prepositions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Farms

0.38

of

0.37

 Operating

0.37

 Sharp

0.37

 affez

0.37

ment

0.36

Від

0.36

Α

0.35

aky

0.34

 internos

0.34

POSITIVE LOGITS

 exfol

0.40

 unmistak

0.40

 obfusc

0.38

 nginx

0.38

ッチン

0.37

쿠

0.37

 neutrophil

0.37

 niacin

0.36

 neurotrans

0.35

 cheating

0.35

Activations Density 0.001%