INDEX
    Explanations

    references to prominent individuals and their contributions in various fields

    New Auto-Interp
    Negative Logits
     Leading
    -0.17
    anny
    -0.15
    ipers
    -0.15
    Į¨
    -0.14
    Leading
    -0.14
    leading
    -0.14
    unik
    -0.14
    versions
    -0.14
    irected
    -0.14
     leading
    -0.13
    POSITIVE LOGITS
     best
    0.68
    best
    0.53
    -best
    0.45
     BEST
    0.41
    Best
    0.41
    (best
    0.41
     known
    0.41
     Best
    0.40
    _best
    0.40
     better
    0.39
    Act Density 0.088%

    No Known Activations