INDEX
    Explanations

    specific examples and listings within various contexts

    New Auto-Interp
    Negative Logits
    wiście
    -0.63
     tartalomajánló
    -0.60
     équip
    -0.54
     morada
    -0.54
     moschino
    -0.53
     Efq
    -0.51
    zewod
    -0.50
    quela
    -0.49
    fois
    -0.49
    Bretagne
    -0.49
    POSITIVE LOGITS
     include
    1.42
     includes
    1.22
     included
    1.04
     Include
    1.00
    include
    1.00
     Includes
    0.96
     INCLUDE
    0.93
     Included
    0.91
    includes
    0.91
    Include
    0.88
    Act Density 0.418%

    No Known Activations