INDEX

Explanations

differences between consecutive

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

秘书

0.45

Collective

0.44

限于

0.43

Army

0.42

armée

0.42

诓

0.42

 Geral

0.41

 అధికారులు

0.41

 আত্মসমর্পণ

0.41

 Ministério

0.41

POSITIVE LOGITS

 equidistant

0.55

 increment

0.54

 intervals

0.53

 increments

0.53

 diferencias

0.53

 consistently

0.52

 incremental

0.51

差

0.50

 equid

0.50

 differences

0.49

Activations Density 0.065%