INDEX
    Explanations

    approximate

    New Auto-Interp
    Negative Logits
    ті
    -0.08
    ткен
    -0.08
     дар
    -0.08
     দর্শ
    -0.08
     доказ
    -0.08
     ביז
    -0.08
    -0.08
    -0.08
     등장
    -0.08
     ұйымда
    -0.08
    POSITIVE LOGITS
     approx
    0.12
     approximate
    0.12
    approx
    0.12
     approximately
    0.11
     aproximadamente
    0.10
     yaklaşık
    0.10
     Approx
    0.10
     approximation
    0.10
    Approx
    0.10
     sekitar
    0.10
    Act Density 0.026%

    No Known Activations