INDEX

Explanations

patterns of contrast or deviation

The neuron fires strongly on technical specification language—numbers, units, model names, acronyms, and other highly domain‐specific jargon in product or malware descriptions.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

非常的

1.44

很多的

1.23

 veldig

1.12

 veramente

1.06

 VERY

1.04

真的是

1.03

 väldigt

1.03

 sogenannten

1.02

 শুধুমাত্র

1.02

썻

1.01

POSITIVE LOGITS

 लिहाजा

0.90

Notably

0.89

 പിന്തുണ

0.83

 преимущественно

0.83

 brimming

0.83

 உள்ளிட்ட

0.82

 amid

0.81

 renferme

0.81

ímp

0.80

ほか

0.79

Activations Density 1.155%