INDEX

Explanations

mentions of the United States and its abbreviations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ennen

-0.08

intree

-0.08

ouncer

-0.07

htub

-0.07

kop

-0.07

.bunifuFlatButton

-0.07

ÑīÐ¸

-0.07

imers

-0.07

_TER

-0.07

Ã¶ff

-0.07

POSITIVE LOGITS

 Virgin

0.09

 Army

0.07

 Postal

0.07

ual

0.07

utex

0.07

_based

0.06

-based

0.06

vs

0.06

 Naval

0.06

 Department

0.06

Activations Density 0.028%