INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     حول
    0.45
    भो
    0.44
     attorno
    0.43
     alrededor
    0.42
     around
    0.41
    around
    0.38
     runt
    0.38
    gyz
    0.37
     rundt
    0.37
     ե
    0.36
    POSITIVE LOGITS
    Close
    0.44
     также
    0.42
     neat
    0.42
     also
    0.40
     également
    0.39
     close
    0.38
     thoughtful
    0.38
     también
    0.38
    Clear
    0.38
    )||
    0.38
    Act Density 0.002%

    No Known Activations