INDEX

Explanations

chemical, security, physical phenomena

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ūsų

0.49

are

0.45

ăți

0.44

 Küche

0.43

 bụng

0.43

ží

0.43

Stuart

0.43

 Plus

0.43

장은

0.42

齜

0.42

POSITIVE LOGITS

heids

0.47

を持

0.46

 insider

0.45

 embezzlement

0.44

 selfish

0.44

 sclerosis

0.43

 insiders

0.43

 nepot

0.43

 narcotics

0.42

 bribery

0.42

Activations Density 0.002%