INDEX

Explanations

forms of harm

The neuron fires on archaic (Early Modern English) present‐tense verb forms ending in “-eth.”

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

Lastly

0.76

 voluminous

0.72

∞</

0.71



0.70

 contentious

0.68

())->

0.68

当該

0.68

 nombrado

0.67

আছে

0.66

 অভ্য

0.66

POSITIVE LOGITS

 Jans

0.84

0.79

 nahin

0.78

Jel

0.76

arie

0.74

erts

0.73

yawa

0.73

ません

0.72

 Bukan

0.72

Activations Density 0.001%