INDEX

Explanations

conjunctions and phrases related to decision-making and action

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oup

-0.07

ump

-0.06

lik

-0.05

å¤ĩ

-0.05

Ã½t

-0.05

åĤĻ

-0.05

osphate

-0.05

emes

-0.05

ones

-0.05

 ones

-0.05

POSITIVE LOGITS

 happening

0.08

atcher

0.07

Ø´ÛĮ

0.07

accom

0.07

done

0.07

 happen

0.07

UNDLE

0.07

 hvordan

0.07

Activations Density 0.015%