INDEX

Explanations

alert levels or threat conditions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

uttgart

-0.81

šenie

-0.75

ക

-0.75

 ravages

-0.73

 insectic

-0.71

有时候

-0.71

 една

-0.71

 vertigo

-0.71

 bestellt

-0.70

 indistinct

-0.69

POSITIVE LOGITS

 alert

3.20

alert

2.61

 Alert

2.42

Alert

2.17

 alerts

2.08

 alerta

1.95

 Level

1.80

 ALERT

1.77

 yellow

1.67

 advisory

1.64

Activations Density 0.032%