tendency or ability

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ability

-1.53

 tendency

-1.15

 Ability

-0.98

 abilities

-0.96

Ability

-0.93

Rohy

-0.90

ability

-0.89

 willingness

-0.88

 propOrder

-0.88

 kaarangay

-0.87

POSITIVE LOGITS

0.70

0.65

for

0.56

in

0.53

 seen

0.52

as

0.49

 textStatus

0.48

 named

0.47

on

0.47

Activations Density 0.088%