INDEX
    Explanations

    was found or discovered

    New Auto-Interp
    Negative Logits
     oppression
    0.78
     혹은
    0.77
     Something
    0.74
    Something
    0.72
     কিংবা
    0.70
     something
    0.69
     persecution
    0.69
    似的
    0.69
    をも
    0.68
     যারা
    0.67
    POSITIVE LOGITS
     reportedly
    0.94
     found
    0.88
     encontrado
    0.85
     único
    0.84
     samarbe
    0.83
     unico
    0.82
     sipping
    0.81
     encontrada
    0.80
    💄
    0.80
     hanya
    0.79
    Act Density 0.008%

    No Known Activations