INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Shar
    -0.08
     narrow
    -0.07
    	None
    -0.07
     pray
    -0.07
     Madrid
    -0.07
     Lazar
    -0.07
    gambar
    -0.07
     spinner
    -0.07
     Brown
    -0.07
    reader
    -0.06
    POSITIVE LOGITS
     ect
    0.08
    atto
    0.07
    crm
    0.07
    ักส
    0.06
     lying
    0.06
    empor
    0.06
     bespoke
    0.06
    cation
    0.06
    <Entity
    0.06
    FieldType
    0.06
    Act Density 0.001%

    No Known Activations