INDEX

Explanations

terms related to online security and spam detection

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oop

-0.07

º

-0.07

oppel

-0.07

IDER

-0.07

ider

-0.07

esser

-0.07

Fet

-0.07

ç¾

-0.06

ataire

-0.06

ailable

-0.06

POSITIVE LOGITS

bot

0.10

bot

0.09

bots

0.08

 bots

0.08

(bot

0.07

-bot

0.07

 robot

0.07

 robots

0.07

çľ

0.07

.bot

0.07

Activations Density 0.005%