INDEX

Explanations

questions and affirmations

sentences that offer help, ask if the user wants something, or invite participation (questions/offers/engagement prompts).

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 implies

0.47

所以

0.44

 conclusion

0.44

 derived

0.43

 extraneous

0.43

 misrepresented

0.43

 insignificant

0.42

 negligible

0.42

哪些

0.41

 nodal

0.41

POSITIVE LOGITS

Yes

0.87

yes

0.76

Yes

0.76

 Yeah

0.71

yes

0.70

YES

0.70

 Sure

0.66

0.64

YES

0.63

 হ্যাঁ

0.61

Activations Density 0.173%