INDEX
    Explanations

    the beginning of a document or section

    New Auto-Interp
    Negative Logits
    انتهای
    -0.63
     ex
    -0.61
    *~*~
    -0.58
     explo
    -0.58
     gl
    -0.56
     $=-
    -0.55
    ixeira
    -0.54
    .%
    -0.53
    aville
    -0.53
    (${
    -0.53
    POSITIVE LOGITS
    <bos>
    0.76
     nessuna
    0.69
     supérieures
    0.67
     gostar
    0.67
     producteurs
    0.66
     rodea
    0.64
     temprana
    0.61
     varandra
    0.61
     nenhuma
    0.60
     nessuno
    0.60
    Act Density 0.550%

    No Known Activations