INDEX
    Explanations

    expressions of surprise or shock

    New Auto-Interp
    Negative Logits
     NSCoder
    -0.70
     Италијани
    -0.68
    EndGlobalSection
    -0.68
     للاسماء
    -0.67
    出版年
    -0.63
     ویکی‌آمباردا
    -0.63
     الحره
    -0.62
     otomatig
    -0.60
    Tikang
    -0.57
    Hentet
    -0.57
    POSITIVE LOGITS
     smiled
    0.61
     grinned
    0.48
     stared
    0.48
     sighed
    0.45
     nodded
    0.43
     smirked
    0.41
     glanced
    0.40
     shrugged
    0.39
     frowned
    0.39
     smile
    0.39
    Act Density 0.385%

    No Known Activations