INDEX
    Explanations

    names of people, sometimes with surrounding context like titles or initials

    New Auto-Interp
    Negative Logits
     lume
    -0.50
    ueba
    -0.50
     marche
    -0.48
    vedra
    -0.48
     Soho
    -0.47
     bross
    -0.47
     Pry
    -0.47
    zám
    -0.46
    barang
    -0.46
    kmale
    -0.46
    POSITIVE LOGITS
    MemoryWarning
    0.75
     يتيمه
    0.72
     MainAxisSize
    0.71
    DoubleQuotes
    0.68
    niająca
    0.66
    
    0.63
    Personendaten
    0.63
    Tembelea
    0.62
    HasForeignKey
    0.61
    AxisAlignment
    0.60
    Act Density 0.591%

    No Known Activations