INDEX
    Explanations

    numbers and punctuation

    New Auto-Interp
    Negative Logits
     to
    -1.58
     declaró
    -1.50
    коля
    -1.48
    6
    -1.46
     confirmó
    -1.45
    кож
    -1.42
     reveló
    -1.41
     but
    -1.41
    之意
    -1.41
     menyadari
    -1.40
    POSITIVE LOGITS
     borracha
    1.83
    我们
    1.66
     ralla
    1.62
     their
    1.57
     two
    1.53
     her
    1.50
     one
    1.49
     him
    1.48
     three
    1.48
     seven
    1.47
    Act Density 0.008%

    No Known Activations