INDEX
    Explanations

    parentheses followed by common conjunctions

    New Auto-Interp
    Negative Logits
    (
    0.64
    0.61
    ر
    0.59
     to
    0.59
    <0x80>
    0.58
     (
    0.53
    ح
    0.52
    ר
    0.52
     be
    0.50
    AR
    0.45
    POSITIVE LOGITS
    u
    0.83
    a
    0.71
    i
    0.60
    z
    0.60
    ى
    0.58
    the
    0.57
    0.57
    0.57
    ة
    0.54
    0.51
    Act Density 0.354%

    No Known Activations