INDEX

Explanations

spy, agent, secret service

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 herd

0.45

 দৈত্য

0.45

 migrate

0.40

 weld

0.40

喳

0.39

㖑

0.39

 गोबर

0.39

벅

0.39

ぞ

0.38

㝅

0.38

POSITIVE LOGITS

spy

1.88

 espionage

1.81

 spies

1.74

spy

1.51

Spy

1.48

 agent

1.45

 agents

1.45

 spying

1.45

 Agent

1.42

Spy

1.42

Activations Density 0.042%