INDEX
    Explanations

    references to specific individuals, particularly the name "Dana" and its variants in various contexts

    mentions of the name "Dana" in various contexts

    New Auto-Interp
    Negative Logits
    */(
    -0.74
    tered
    -0.69
    eering
    -0.67
     invari
    -0.67
    inent
    -0.66
    blem
    -0.66
    asking
    -0.65
    GoldMagikarp
    -0.63
    ailable
    -0.63
    itudinal
    -0.63
    POSITIVE LOGITS
    iesel
    0.82
     Scully
    0.81
     Vin
    0.80
    eger
    0.76
     Brooke
    0.75
    uala
    0.75
    her
    0.74
    uin
    0.73
    ei
    0.72
    illus
    0.72
    Act Density 0.049%

    No Known Activations