INDEX

Explanations

Domestic violence

The neuron strongly activates on commercial service‐oriented keywords—especially titles of download‐music (“lagu”) and moving/packing services (“Movers,” “Packers,” “Removals”)—indicating it flags advertisement or directory‐style terms.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

is

0.91

il

0.80

スナー

0.79

li

0.78

Royal

0.75

Storia

0.73

ir

0.73

é

0.72

ort

0.70

POSITIVE LOGITS

ческую

0.92

ното

0.86

нома

0.84

 повы

0.82

ческому

0.81

 annih

0.80

ственной

0.79

穸

0.79

 некоторых

0.78

йт

0.77

Activations Density 0.001%