INDEX

Explanations

KKK, Nazi, white supremacy, Tea Party

The neuron flags mentions of extremist or hate‐group references (e.g. KKK, Tea Party, neo-Nazis, white supremacists, lynch mobs, etc.).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

-1.98

-1.72

-1.66

-1.62

/*

-1.59

Enjoy

-1.59

-1.57

-1.55

Also

-1.55

Do

-1.55

POSITIVE LOGITS

 fervor

1.80

 lynch

1.73

MOVIE

1.73

鉝

1.60

 alimentare

1.60

 fundament

1.58

 spati

1.58

zelfde

1.52

 technic

1.48

 comanda

1.46

Activations Density 0.014%