INDEX

Explanations

demographic information

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

surname

-0.12

 demographics

-0.11

 surname

-0.11

Surname

-0.11

 demographic

-0.11

èĢ

-0.10

PHA

-0.10

 Geography

-0.09

 Diary

-0.09

Î´ÎŃ

-0.09

POSITIVE LOGITS

 religion

0.18

 height

0.18

 occupation

0.17

 Religion

0.15

 Height

0.15

 marital

0.14

 employment

0.14

 education

0.13

-height

0.13

Height

0.13

Activations Density 0.153%