INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ї

0.54

seven

0.53

peptide

0.52

PhD

0.50

各项

0.49

Yesterday

0.49

Defendants

0.49

 যাবতীয়

0.48

粑

0.48

POSITIVE LOGITS

 éviter

0.64

 evitare

0.64

 jiné

0.61

 evitar

0.61

 utiliser

0.61

 wisely

0.61

 verwenden

0.58

 refrained

0.58

 avoid

0.57

 siempre

0.56

Activations Density 0.000%

No Known Activations