INDEX
    Explanations

    the word "complete" followed by a number

    New Auto-Interp
    Negative Logits
    maid
    -0.91
    hao
    -0.78
    hops
    -0.75
    spr
    -0.71
    uay
    -0.69
    */(
    -0.66
     Zub
    -0.66
    adish
    -0.65
    anwhile
    -0.65
    UL
    -0.65
    POSITIVE LOGITS
     strangers
    1.01
    bred
    0.94
     stranger
    0.85
     immersion
    0.84
     disregard
    0.83
     lack
    0.83
     absence
    0.81
     beginners
    0.80
     fabrication
    0.79
     annihilation
    0.79
    Act Density 0.026%

    No Known Activations