INDEX

Explanations

negative judgment on descriptions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 மற்றும்

0.54

0.53

 మరియు

0.47

Ин

0.46

 EXPER

0.46

 ಮತ್ತು

0.44

 சிகிச்ச

0.44

т

0.43

ி

0.43

ت

0.42

POSITIVE LOGITS

 immoral

0.51

 blackmail

0.49

 الدنيا

0.49

 multinationals

0.48

 meagre

0.48

 ridicule

0.47

 disprove

0.47

 malign

0.47

 scanty

0.46

 petty

0.46

Activations Density 0.010%