INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    та
    0.99
    is
    0.98
    t
    0.95
    א
    0.87
    ك
    0.79
    я
    0.76
    the
    0.74
     in
    0.73
    it
    0.73
    га
    0.72
    POSITIVE LOGITS
     
    0.95
    0.76
     दो
    0.71
     valore
    0.70
     aspetti
    0.68
    0.68
     beaker
    0.65
     pernyataan
    0.65
     biomarker
    0.64
     antigen
    0.63
    Act Density 0.000%

    No Known Activations