INDEX

Explanations

superior followed by positive outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ش

1.03

ت

0.95

س

0.87

স

0.84

ق

0.84

ع

0.84

ن

0.82

ج

0.79

ш

0.79

0.78

POSITIVE LOGITS

 whistleblower

0.68

 kõik

0.63

 immobil

0.59

 seeker

0.59

 thermoelectric

0.59

 borrower

0.58

 reimbursement

0.58

 staunch

0.57

 workstation

0.57

 cathode

0.56

Activations Density 0.001%