INDEX

Explanations

influences or impacts

This neuron flags broad evaluative or emphasis phrases—especially ones that stress importance or what “counts” (e.g. “of equal importance,” “what counts,” “what is most important”).

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 Morgan

0.57

 postup

0.55

 अशी

0.55

 Dana

0.55

 Whitley

0.53

Easing

0.53

却没有

0.52

 Fraser

0.52

 altına

0.52

 Sriniv

0.52

POSITIVE LOGITS

 important

2.81

important

2.78

重要

2.63

 중요

2.61

 importante

2.55

 مهم

2.50

 важли

2.50

 Important

2.48

 belangrijk

2.48

 penting

2.45

Activations Density 0.652%