INDEX
    Explanations

    references to famous books, fictional characters, and political figures

    New Auto-Interp
    Negative Logits
    */(
    -0.83
    eros
    -0.82
    uably
    -0.81
    iasco
    -0.76
     Helpful
    -0.71
    fficient
    -0.70
    ebin
    -0.70
    USD
    -0.68
    arbon
    -0.67
    ctrl
    -0.67
    POSITIVE LOGITS
     Admir
    0.77
    cliffe
    0.76
     Sovereign
    0.74
     Rothschild
    0.74
     Prayer
    0.73
     Duchess
    0.68
    Card
    0.67
     Majesty
    0.67
    assies
    0.67
     Lann
    0.66
    Act Density 16.550%

    No Known Activations