INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hoover
    -0.74
    WOOD
    -0.73
     Meadow
    -0.70
     ECO
    -0.68
     Ames
    -0.67
     Weir
    -0.66
     clearance
    -0.66
     Wast
    -0.66
     Kurd
    -0.65
     Wem
    -0.64
    POSITIVE LOGITS
    piracy
    1.57
    ervatives
    1.52
    umers
    1.46
    ensus
    1.40
    istent
    1.35
    cientious
    1.35
    ensual
    1.35
    olid
    1.32
    idered
    1.31
    erv
    1.30
    Act Density 0.005%

    No Known Activations