INDEX

Explanations

expressions of deceit or falsehood

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 AssemblyTitle

-0.73

djangoproject

-0.69

WriteTagHelper

-0.65

AddTagHelper

-0.61

antart

-0.61

ССР

-0.57

omiast

-0.56

urally

-0.56

oweit

-0.56

コト

-0.55

POSITIVE LOGITS

lied

3.58

lying

1.23

lies

1.22

PLIED

1.17

lieder

0.79

wal

0.67

 Lied

0.65

ply

0.59

 Baum

0.56

liez

0.55

Activations Density 0.001%