INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     success
    -1.20
     succeed
    -0.96
     Success
    -0.91
     succeeded
    -0.91
    success
    -0.81
    succeeded
    -0.80
     succeeds
    -0.79
     successes
    -0.77
    Success
    -0.74
    succeed
    -0.70
    POSITIVE LOGITS
     âgé
    0.60
     traditionnels
    0.57
     متعلقه
    0.56
     traditionnelle
    0.56
     tradicionales
    0.54
    MemoryWarning
    0.53
     täh
    0.53
     célèbres
    0.52
     bicchiere
    0.50
     miliardi
    0.50
    Act Density 0.034%

    No Known Activations