INDEX

Explanations

graphic violence or explicit sexual

The neuron flags content‐heavy, typically multi-syllabic “meaningful” words (i.e. abstract nouns, technical terms, adjectives and other non-function words) rather than common function words.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

ට්ට

0.72

ilikom

0.65

 Smoothing

0.65

adet

0.62

 subgoal

0.62

 ปุ่ม

0.62

้าม

0.61

幼稚園

0.61

subnet

0.61

मार्ग

0.61

POSITIVE LOGITS

 riches

0.82

 위해

0.76

 gleaned

0.76

 science

0.71

 publice

0.70

 civilisation

0.69

 grâce

0.67

 faisait

0.67

 sciences

0.66

 durant

0.66

Activations Density 0.731%