INDEX

Explanations

body parts, negation, observation

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ചെറിയ

0.54

 छोटी

0.43

ınıza

0.42

 vrouwen

0.40

 pequeñas

0.40

cellent

0.39

 США

0.39

 slightly

0.39

스

0.38

 chhoti

0.37

POSITIVE LOGITS

 omnip

0.46

 endem

0.43

 tưởng

0.40

 столь

0.40

 eterno

0.39

 ostensibly

0.39

 eigens

0.39

 eternally

0.38

 transcendental

0.38

 crainte

0.38

Activations Density 0.035%