INDEX

Explanations

code formatting and comments

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

if

0.96

0.93

ė

0.92

↵

0.86

е

0.86

eseorang

0.78

с

0.78

in

0.77

func

0.77

POSITIVE LOGITS

 fearing

0.95

 subjecting

0.89

lara

0.88

 굉장

0.87

և

0.86

 gastritis

0.85

യുടെ

0.84

 unpublished

0.84

 classifying

0.84

 genus

0.83

Activations Density 0.222%