INDEX
    Explanations

    mentions of specific individuals

    the presence of the word "ve" in various contexts

    New Auto-Interp
    Negative Logits
    £ı
    -0.81
    olicy
    -0.74
     GOODMAN
    -0.73
    SpaceEngineers
    -0.65
    artifacts
    -0.65
    matically
    -0.65
     administ
    -0.64
    assian
    -0.64
     resize
    -0.63
    wcs
    -0.62
    POSITIVE LOGITS
    illance
    1.19
    mber
    1.15
    ttes
    1.09
    rette
    1.07
    llers
    1.05
    ller
    1.03
    ggie
    1.03
    lla
    1.00
    tt
    0.99
    tta
    0.93
    Act Density 0.038%

    No Known Activations