INDEX

Explanations

Okay, positive affirmation

The neuron fires on emphatic, exclamatory praise—strong positive evaluations often accompanied by exclamation marks.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 anyone

0.63

悲

0.61

 nightmare

0.60

 воспомина

0.60

的日子

0.60

 ненави

0.60

 dreadful

0.58

 heartbreak

0.57

 nightmares

0.55

 TODAY

0.55

POSITIVE LOGITS

 commendable

1.72

 kudos

1.55

 applaud

1.53

 commended

1.48

 Kudos

1.45

 admirable

1.42

 congratulate

1.39

 applauded

1.39

 commend

1.38

 congratulations

1.36

Activations Density 0.363%