INDEX
    Explanations

    phrases related to books or literature, and also words related to vehicles

    New Auto-Interp
    Negative Logits
     accla
    -1.46
     emphat
    -1.41
     madonna
    -1.41
     embra
    -1.41
     secon
    -1.40
     wien
    -1.39
     casio
    -1.38
     increa
    -1.37
     perfet
    -1.37
     vhs
    -1.36
    POSITIVE LOGITS
     tiny
    0.90
     small
    0.87
     Small
    0.82
    small
    0.82
    Small
    0.82
    tiny
    0.79
     kecil
    0.76
    和小
    0.74
    小的
    0.73
     little
    0.72
    Act Density 0.461%

    No Known Activations