INDEX
    Explanations

    phrases that indicate mandatory actions or emphasize strong emotional expressions

    New Auto-Interp
    Negative Logits
    تقاوى
    -0.89
    })).
    -0.88
    ()]);
    -0.87
    ."]
    -0.85
     ]);
    -0.79
    ."));
    -0.77
    "]);
    
    -0.76
    .');
    -0.75
    FieldBuilder
    -0.75
    .");
    -0.74
    POSITIVE LOGITS
    \{\\
    0.82
     di
    0.56
     “
    0.53
    win
    0.51
    mistic
    0.51
    ised
    0.50
    姆斯
    0.50
    0.49
     Win
    0.47
    iastes
    0.47
    Act Density 0.026%

    No Known Activations