INDEX

Explanations

Reducing

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Reducing

-0.82

reducing

-0.79

reduction

-0.76

Reduction

-0.72

 Reducing

-0.58

 REDUC

-0.58

 REDUCTION

-0.58

ParallelGroup

-0.55

Reduced

-0.55

 defaultstate

-0.54

POSITIVE LOGITS

 espirituales

0.64

0.63

getDescription

0.59

 militaires

0.59

 médicas

0.57

 Dosen

0.53

irot

0.53

前言

0.52

 églises

0.52

sen

0.52

Activations Density 0.107%