INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     lezen
    -0.08
     reading
    -0.08
    Uk
    -0.08
     hup
    -0.08
     ọsọ
    -0.08
    apk
    -0.08
     Reading
    -0.07
     scarcity
    -0.07
    اقتص
    -0.07
     زن
    -0.07
    POSITIVE LOGITS
     meticulously
    0.09
     detailed
    0.09
     Hieronder
    0.09
    data
    0.09
     detaill
    0.08
     [{↵
    0.08
    nested
    0.08
    [↵
    0.08
    construction
    0.08
     cuidadosamente
    0.08
    Act Density 0.010%

    No Known Activations