INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    InjectAttribute
    -0.63
    تقاوى
    -0.63
    '];?>
    -0.61
    ."]
    -0.61
     unſ
    -0.61
    ...");
    -0.60
    ']?>
    -0.60
     :");
    -0.59
     ."
    -0.59
    ...")
    -0.58
    POSITIVE LOGITS
    ,
    0.84
    %,
    0.64
    ,<
    0.62
    $,
    0.61
    0.60
    #,
    0.57
    ،
    0.57
     \%,
    0.55
    ++,
    0.53
     %,
    0.53
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.