INDEX

Explanations

divert, distract, divergence, distraction

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

و

2.85

 избежать

2.78

,\,\

2.77

️⃣

2.68

هل

2.64

{$

2.64

তে

2.61

 carg

2.61

 tended

2.60

ो

2.59

POSITIVE LOGITS

াল

2.89

б

2.38

 homen

2.38

 anest

2.33

ते

2.33

ы

2.31

ნ

2.29

</iframe>

2.28

garakan

2.25

ﾝ

2.21

Activations Density 0.081%