INDEX

Explanations

possible to guess

The neuron spots occurrences of the adjectives “possible” or “impossible” (often with the following “to”), i.e. expressions of possibility or impossibility.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

Etimología

-1.78

࿆

-1.67

裒

-1.67

佧

-1.66

っそ

-1.66

 gustado

-1.64

רום

-1.64

 traducciones

-1.64



-1.64

 gebruikt

-1.63

POSITIVE LOGITS

2.78

勿論

1.59

at

1.59

</h4>

1.55

 một

1.52

 October

1.48

re

1.46

naments

1.41

 August

1.37

Activations Density 0.004%