INDEX

Explanations

dates and years

mentions of dates or date-related phrases (e.g., years, months, "current date", "knowledge cutoff").

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

-0.09

Hod

-0.09

 Landing

-0.09

Lor

-0.08

 Morr

-0.08

POSITIVE LOGITS

âĸłâĸł

0.11

Ð¿ÑĢÐ¸Ð¼ÐµÑĢ

0.09

toa

0.09

 Schultz

0.09

.UserInfo

0.09

ccess

0.09

ellers

0.08

 Cassidy

0.08

ynet

0.08

 Chat

0.08

Activations Density 0.012%