INDEX
    Explanations

    names of famous individuals

    names of notable individuals and references to performance or reputation

    New Auto-Interp
    Negative Logits
    obo
    -0.71
    istically
    -0.70
    ño
    -0.66
    iru
    -0.66
    ASH
    -0.65
    iso
    -0.64
    gdala
    -0.62
     Seg
    -0.60
    emia
    -0.60
    llah
    -0.60
    POSITIVE LOGITS
     Phelps
    0.95
     Manson
    0.89
    icka
    0.87
    mans
    0.82
    achusetts
    0.79
    mann
    0.79
    liga
    0.78
    ter
    0.77
    itudes
    0.77
    enium
    0.76
    Act Density 0.024%

    No Known Activations