INDEX

Explanations

listing entity names

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 synonyms

-0.11

 åŃĹ

-0.09

 verbs

-0.09

exels

-0.09

dictionary

-0.09

 subtitle

-0.08

shint

-0.08

 nghÄ©a

-0.08

Laf

-0.08

antry

-0.08

POSITIVE LOGITS

 mention

0.22

 entity

0.21

 mentioned

0.20

 Mention

0.19

 mentions

0.19

 proper

0.19

 entities

0.18

 Proper

0.17

 names

0.17

mentioned

0.16

Activations Density 0.243%