INDEX

Explanations

phrases indicating denial of wrongdoing or conspiracy theories

Explanatory text revealing the reasoning, justification, or context behind actions, decisions, or policies, often in quoted statements from officials or reports.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

itler

-0.09

 Uncategorized

-0.09

bject

-0.08

agher

-0.08

..↵↵↵↵

-0.08

buat

-0.08

ãĥ»ãĥ»ãĥ»↵↵

-0.08

Ä±lÄ±ÄŁÄ±yla

-0.07

asal

-0.07

,)↵

-0.07

POSITIVE LOGITS

 [âĢ¦]

0.12

[,]

0.11

%C

0.10

[s

0.10

 [âĢ¦

0.10

[=

0.10

[d

0.09

[_

0.09

 [...]↵↵

0.08

[`

0.08

Activations Density 10.109%