INDEX

Explanations

items with associated metrics or types

domain-specific content keywords, especially concrete nouns that signal the passage’s main topic or subject.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 subjug

0.36

 testes

0.34

 multiplic

0.32

 Fäh

0.31

 símbolo

0.30

 බොහෝ

0.30

 plunder

0.30

 figur

0.30

 sasan

0.30

 Bisa

0.30

POSITIVE LOGITS

us

0.33

之类的

0.32

ul

0.31

on

0.30

os

0.29

ers

0.29

ic

0.29

ayın

0.29

ig

0.28

ai

0.28

Activations Density 1.352%