INDEX

Explanations

itself, themselves, often reflexive

paradoxes, ironic statements, and self-referential observations where situations contain internal contradictions or unexpected inversions.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 עם

0.42

 অস্বাভাবিক

0.42

 因为

0.41

 abnorm

0.40

我不

0.40

 reciente

0.40

我

0.40

 বা

0.39

 absolutamente

0.39

 నాకు

0.39

POSITIVE LOGITS

 itself

0.60

 stessi

0.57

 themselves

0.54

本身的

0.53

 stesse

0.50

本身

0.43

 сами

0.43

த்திலேயே

0.42

 mismos

0.42

 during

0.40

Activations Density 0.058%