INDEX

Explanations

distances and units

The neuron fires strongly on tokens that appear in vacation‐rental or hotel listing descriptions—e.g. amenities, unit details, distances, and other accommodation‐ad language.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

fuls

-0.79

 simile

-0.77

 gigante

-0.72

AllAfrica

-0.71

ések

-0.71

 marco

-0.70

Simulator

-0.70

sole

-0.69

 статье

-0.69

linie

-0.69

POSITIVE LOGITS

 accommodations

0.84

颌

0.74

 Indians

0.73

 attracting

0.69

 keen

0.68

 anpassen

0.68

力が

0.66

🅴

0.66

লি

0.65

 atraer

0.65

Activations Density 0.098%