INDEX
    Explanations

    mentions of the name "Tom" in various contexts

    New Auto-Interp
    Negative Logits
    iaux
    -0.18
    ABLE
    -0.17
    able
    -0.16
    anine
    -0.16
    vet
    -0.15
    unning
    -0.15
    loi
    -0.15
    uben
    -0.15
    ifiable
    -0.14
    AMP
    -0.14
    POSITIVE LOGITS
    Tom
    0.25
    atoes
    0.24
    tom
    0.22
    orrow
    0.20
    mas
    0.20
    kins
    0.19
    asso
    0.19
    cat
    0.19
    islav
    0.19
    asz
    0.18
    Act Density 0.018%

    No Known Activations