INDEX

Explanations

pornography and sexual acts

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ق

1.52

 promov

1.34

ຕ້ອງ

1.26

ीडी

1.23

ใน

1.23

ในช่วง

1.18

𝑻

1.18

იან

1.16

 един

1.16

 empate

1.14

POSITIVE LOGITS

1.23

ভাবে

1.20

1.16



1.16

 wares

1.15

 subtracted

1.14

 other

1.12

 penetrated

1.09

 kelamin

1.08

 gerais

1.07

Activations Density 0.036%