INDEX
    Explanations

    emotional expressions and reactions

    New Auto-Interp
    Negative Logits
    providedIn
    -0.96
     <=",
    -0.95
    tvguidetime
    -0.82
     EconPapers
    -0.78
    MemoryWarning
    -0.78
    WebElementEntity
    -0.78
    ?}",
    -0.77
     sumpay
    -0.76
    المشاركات
    -0.74
    Datuak
    -0.74
    POSITIVE LOGITS
    ↵↵
    0.79
    0.68
    以上
    0.61
    <eos>
    0.60
     以上
    0.57
     These
    0.56
    <h3>
    0.53
    These
    0.52
      
    0.50
    <h2>
    0.49
    Act Density 0.033%

    No Known Activations