INDEX

Explanations

aggressive nature, behaviors, treatments

The neuron is looking for the adjective "aggressive" and related terms describing intensity or severity.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

\"

-3.17

-3.00

-2.98

we

-2.91

``

-2.81

-2.72

↵↵

-2.70

-2.67

-2.55

-2.53

POSITIVE LOGITS

愎

2.95

Важно

2.77

𓄹

2.72

鋮

2.53

啦

2.52

しており

2.50

地说

2.48

ጮ

2.42

欉

2.42

笈

2.41

Activations Density 0.003%