INDEX

Explanations

environment and health

references to environmental impact or environmental harm.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

to

-1.14

for

-1.04

方がいい

-0.99

necedor

-0.94

 между

-0.91

ほうがいい

-0.91

utkan

-0.87

 oscuros

-0.87

americas

-0.85

 kreeg

-0.84

POSITIVE LOGITS

by

1.07

特别是

1.05

something

1.05

 продъл

1.03

1.00

sol

1.00

gart

0.94

럽

0.92

 when

0.91

 long

0.90

Activations Density 0.041%