INDEX

Explanations

crime and illicit activities

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 consonant

0.96

 vowel

0.88

 print

0.83

newLine

0.82

 philosopher

0.81

꼰

0.79

 Print

0.79

reducible

0.76

endocrine

0.76

 chồng

0.75

POSITIVE LOGITS

 heist

1.93

 theft

1.87

 thefts

1.80

 Theft

1.79

 stealing

1.71

 thieves

1.65

 thief

1.62

 steal

1.58

 steals

1.54

盗

1.49

Activations Density 0.309%