INDEX
Explanations
conversational interactions and inquiries involving shared experiences or questions
New Auto-Interp
Negative Logits
بيها
-0.79
للمعارف
-0.69
متعلقه
-0.64
Czytaj
-0.59
ślę
-0.57
ihnachten
-0.56
Identyfik
-0.56
+#+#
-0.55
unanje
-0.54
antaranya
-0.52
POSITIVE LOGITS
?!
0.84
?!?
0.78
?)
0.78
?),
0.73
!?
0.71
?).
0.68
?"
0.68
?!?!
0.67
?:
0.67
?!"
0.67
Activations Density 0.151%