INDEX

Explanations

anger and frustration

the presence of strong negative emotional or anger-related language indicating high emotional intensity.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 Embedding

0.44

 Spec

0.42

Spec

0.42

 அதிசய

0.42

瞞

0.41

ייש

0.40

ifo

0.39

 nanom

0.39

weak

0.38

 چاندی

0.38

POSITIVE LOGITS

 anger

2.23

 angry

2.22

 angrily

2.05

愤怒

1.94

 colère

1.91

 गुस्से

1.82

 enraged

1.80

 Anger

1.77

怒

1.77

 rage

1.76

Activations Density 0.069%