INDEX

Explanations

if properly

This neuron activates on first- and second-person pronouns and related self-/direct-address phrases (e.g., “I,” “you,” “as long as,” “let myself”).

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 เพื่อ

0.77

 ដើម្បី

0.72

 để

0.70

 щоб

0.70

 чтобы

0.69

เพื่อให้

0.68

เพื่อ

0.67

 endangering

0.65

 να

0.65

 इसलिए

0.63

POSITIVE LOGITS

 условии

0.73

 correctement

0.72

 properly

0.71

正しい

0.71

proper

0.70

 doğru

0.68

 بشر

0.68

 Careful

0.68

 Properly

0.67

Proper

0.66

Activations Density 0.536%