INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    an
    0.88
    at
    0.82
    ğini
    0.79
    ل
    0.79
    zelfde
    0.77
    n
    0.73
    ن
    0.70
    nione
    0.69
    al
    0.68
    or
    0.67
    POSITIVE LOGITS
    '
    0.77
    0.67
    \
    0.65
    $
    0.65
     than
    0.61
     will
    0.60
     be
    0.59
     takes
    0.59
     calipers
    0.58
    0.58
    Act Density 0.534%

    No Known Activations