INDEX

Explanations

the term "fan" in different contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

===============

-0.73

 gaun

-0.65

Nex

-0.63

kj

-0.62

 Whittaker

-0.61

///////////////

-0.60

שה

-0.59

 fris

-0.59

bé

-0.58

herin

-0.57

POSITIVE LOGITS

fan

1.64

Fan

1.64

 fans

1.59

FAN

1.59

Fan

1.55

 Fans

1.55

fan

1.53

FAN

1.48

fans

1.46

Fans

1.45

Activations Density 0.013%