INDEX

Explanations

the word "but" and its variations to highlight contrasting ideas or exceptions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

eniz

-0.07

 Kaynak

-0.07

 kÄ±l

-0.07

åĽ

-0.07

 åĨĨ

-0.07

 láº¡i

-0.07

enaire

-0.07

Ìģt

-0.07

oti

-0.07

.dump

-0.07

POSITIVE LOGITS

 otherwise

0.08

Anyway

0.08

 nevertheless

0.07

 Anyway

0.07

 still

0.07

 basically

0.07

 Still

0.07

 fine

0.07

 nonetheless

0.06

 Otherwise

0.06

Activations Density 0.037%