INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    λαν
    0.32
    一方面
    0.31
     agg
    0.29
    ंसी
    0.29
    ش
    0.29
    ስታ
    0.28
     drum
    0.27
    طف
    0.27
     breasts
    0.27
    يتر
    0.27
    POSITIVE LOGITS
     همچنین
    0.44
     inoltre
    0.41
     നിരവധി
    0.41
     ayrıca
    0.40
     ასევე
    0.39
     Ayrıca
    0.39
     በተጨማሪ
    0.39
     또한
    0.38
     আরেকটি
    0.37
     Inoltre
    0.37
    Act Density 0.417%

    No Known Activations