INDEX

Explanations

model explaining something

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bor

0.61

sommets

0.59

мов

0.58

Bos

0.58

 ennemis

0.58

 glande

0.57

 बॉल

0.57

 தோட்ட

0.57

‚¬

0.56

fopen

0.56

POSITIVE LOGITS

 maxX

0.59

请

0.58

 کیفیت

0.57

 Independent

0.53

independent

0.51

 সম্পর্কের

0.51

 muscle

0.50

 independent

0.50

ﻀ

0.50

 искусство

0.49

Activations Density 0.388%