INDEX
Explanations
asking questions and offering help
New Auto-Interp
Negative Logits
iqueness
0.87
includes
0.82
itals
0.80
stets
0.80
সর্বত্র
0.79
往往
0.78
重要
0.78
이때
0.76
अक्सर
0.76
posteriores
0.76
POSITIVE LOGITS
trivia
1.18
jokes
1.14
joke
1.12
Jokes
1.10
quiz
1.10
Trivia
1.09
chat
1.08
Joke
1.08
riddle
1.07
brainstorm
1.03
Activations Density 2.236%