INDEX

Explanations

preposition followed by a noun

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 hazards

0.80

 hazard

0.79

 parlano

0.72

蚩

0.70

ty

0.69

 fridge

0.68

tsd

0.68

㘿

0.68

 censored

0.67

 nestled

0.67

POSITIVE LOGITS

 upon

2.93

 Upon

2.83

Upon

2.81

upon

2.72

 waarop

1.89

 върху

1.65

 whereupon

1.57

 asupra

1.57

 üzerine

1.48

 auquel

1.39

Activations Density 0.140%