INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     then
    -0.55
    /-/
    -0.53
     I
    -0.50
     himo
    -0.48
     jScrollPane
    -0.47
    CrossRef
    -0.47
    WithMany
    -0.46
     i
    -0.45
     ErrIntOverflow
    -0.45
    vignon
    -0.45
    POSITIVE LOGITS
     meglio
    0.76
    better
    0.73
     meilleure
    0.72
     better
    0.67
    mejor
    0.66
     meilleur
    0.66
     migliore
    0.65
     besseren
    0.65
     mejor
    0.63
     melhor
    0.63
    Act Density 0.004%

    No Known Activations