INDEX

Explanations

balance and trade-offs

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

riad

-0.11

mos

-0.10

itol

-0.09

.scalablytyped

-0.09

SSIP

-0.09

reeze

-0.09

kontakte

-0.09

_dropout

-0.08

buster

-0.08

ypse

-0.08

POSITIVE LOGITS

 balance

0.82

 Balance

0.68

balance

0.65

Balance

0.62

 balances

0.58

 balancing

0.56

_balance

0.49

.balance

0.47

alance

0.45

bal

0.43

Activations Density 0.190%