INDEX

Explanations

pun intended or literally

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 거죠

0.67

 xác

0.64

 thỏa

0.61

Anything

0.61

 arises

0.61

 explains

0.61

ாரி

0.60

룹

0.58

समझ

0.58

 સમજ

0.57

POSITIVE LOGITS

pun

1.92

pun

1.78

 puns

1.61

Pun

1.59

Pun

1.55

PUN

1.54

 metaphor

1.50

 analogy

1.37

 pardon

1.31

 literally

1.29

Activations Density 0.020%