INDEX

Explanations

healthy and youthful

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

OfSize

-0.11

 flourishing

-0.10

FromClass

-0.10

SimpleName

-0.09

 åĸ

-0.09

 ëĦ¤ìĿ´íĬ¸

-0.09

EMPLARY

-0.09

affer

-0.09

 egregious

-0.09

POSITIVE LOGITS

 healthy

0.26

 happy

0.19

healthy

0.18

 Healthy

0.18

åģ¥åº·

0.17

 alert

0.15

happy

0.15

 content

0.15

 well

0.15

 HEALTH

0.15

Activations Density 0.109%