INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    $+
    0.52
    리와
    0.50
    \%,
    0.50
     insectes
    0.50
    رسٹ
    0.49
    0
    0.48
    0.48
    [{\
    0.47
    $>$
    0.47
    $>
    0.47
    POSITIVE LOGITS
     be
    0.84
     de
    0.75
    t
    0.75
     can
    0.72
    de
    0.71
    in
    0.70
    n
    0.65
     in
    0.62
     fueled
    0.60
    ли
    0.58
    Act Density 0.092%

    No Known Activations