INDEX
    Explanations

    words related to development or evolution

    instances of the word "dev" in various forms, particularly in relation to development or evolution

    New Auto-Interp
    Negative Logits
    berman
    -0.70
     Twain
    -0.70
    terday
    -0.69
    xual
    -0.68
     Sapp
    -0.66
     Rouge
    -0.66
     Moonlight
    -0.66
    SHIP
    -0.65
     Tempest
    -0.65
    ¥µ
    -0.64
    POSITIVE LOGITS
    olved
    1.46
    irtual
    1.43
    olution
    1.30
    iated
    1.30
    olve
    1.27
    iates
    1.24
    iant
    1.19
    iate
    1.19
    olving
    1.17
    iating
    1.16
    Act Density 0.026%

    No Known Activations