INDEX

Explanations

authority and dominance

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

icle

0.97

 crock

0.95

 desert

0.94

tep

0.93

 mishap

0.93

反応

0.90

 leak

0.90

 reaction

0.90

沥

0.89

 reacted

0.87

POSITIVE LOGITS

authority

1.62

Authority

1.58

 authority

1.58

 dominion

1.52

commands

1.46

 কর্তৃত্ব

1.46

 hegemony

1.45

Control

1.44

 contrôle

1.40

control

1.39

Activations Density 0.464%