INDEX

Explanations

fake or surrogate

The neuron is highly responsive to personal‐reference words—especially first‐person pronouns and analogous kinship or self-referential terms.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

eğ

0.74

Projectile

0.71

Attack

0.68

 meneg

0.68

Motivational

0.67

오

0.67

Histogram

0.66

 motorcycl

0.66

 sistemat

0.65

Weapon

0.64

POSITIVE LOGITS

 surrog

0.98

 surrogate

0.98

 Surrogate

0.86

 transf

0.84

 fraudulently

0.82

偽

0.80

 अनजान

0.77

 identities

0.77

 transplantation

0.77

 جعلی

0.75

Activations Density 1.710%