INDEX

Explanations

negative criticism or judgment

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 unsatisfactory

-1.05

onnen

-0.96

Composite

-0.88

 étrange

-0.85

ष्ठ

-0.84

 deplorable

-0.83

 sentire

-0.83

 panini

-0.83

 cess

-0.82

 pade

-0.82

POSITIVE LOGITS

 criticism

1.33

 defensive

1.14

 critic

1.09

 critical

1.04

 criticized

1.01

judgment

1.01

 gossip

1.00

 boss

0.97

 critici

0.96

critic

0.95

Activations Density 0.027%