INDEX

Explanations

health risks and conditions

content about health hazards, injuries, and diseases that can cause harm to humans.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

to

-1.44

耵

-1.44

 selle

-1.23

phoenix

-1.20

 Dicht

-1.20

 Werken

-1.18

 AMONG

-1.17

 mulai

-1.17

吠

-1.17

uesday

-1.16

POSITIVE LOGITS

 like

1.40

这两个

1.40

Like

1.36

These

1.30

Another

1.30

Both

1.27

これらの

1.26

 dichos

1.25

Such

1.24

 albeit

1.22

Activations Density 0.115%