INDEX
    Explanations

    small concepts or words

    New Auto-Interp
    Negative Logits
    उनके
    0.39
     Pathway
    0.38
     pathway
    0.38
     humains
    0.37
    Celestial
    0.37
     pith
    0.37
     gentil
    0.37
    portions
    0.37
     guid
    0.36
     portions
    0.36
    POSITIVE LOGITS
     calls
    0.45
     کوچک
    0.45
     small
    0.43
     SMALL
    0.42
     küçük
    0.41
     маленький
    0.41
     arbejde
    0.41
     tareas
    0.40
     عمل
    0.40
     arbete
    0.40
    Act Density 0.001%

    No Known Activations