INDEX
    Explanations

    names and relevant entities in various contexts

    New Auto-Interp
    Negative Logits
    iola
    -0.18
    324
    -0.17
    etter
    -0.15
    shore
    -0.14
    912
    -0.14
    ton
    -0.14
    eties
    -0.14
    throp
    -0.14
    lyn
    -0.14
    ges
    -0.14
    POSITIVE LOGITS
     upside
    0.28
    (turn
    0.22
     turned
    0.20
     loose
    0.20
    turn
    0.20
     Turn
    0.19
    into
    0.19
    .turn
    0.18
     turn
    0.18
    Turn
    0.18
    Act Density 0.024%

    No Known Activations