INDEX
    Explanations

    not interacting or not happening

    New Auto-Interp
    Negative Logits
     rapidement
    0.46
    まさに
    0.46
    both
    0.45
     различными
    0.45
     galore
    0.45
     both
    0.44
     다양한
    0.44
     excelente
    0.44
     প্রতিটি
    0.44
     whatnot
    0.44
    POSITIVE LOGITS
     anymore
    1.08
    任何
    0.84
     sondern
    0.79
     nor
    0.73
     إلا
    0.73
     alcuna
    0.72
     anything
    0.71
     siquiera
    0.71
     alcun
    0.71
    也不会
    0.70
    Act Density 0.042%

    No Known Activations