INDEX

Explanations

possessives and their associated nouns

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Types

0.37

possibly

0.35

 Possibly

0.34

 posibles

0.33

不错的

0.33

 forskellige

0.33

的一些

0.33

шем

0.32

Specific

0.32

을

0.32

POSITIVE LOGITS

 fingers

0.51

 eyes

0.45

oretically

0.43

 onus

0.43

意思是

0.43

目的是

0.42

性は

0.41

 fingerprints

0.39

 goal

0.39

 문제는

0.39

Activations Density 0.024%