INDEX
    Explanations

    references to modes of transportation, specifically trains

    New Auto-Interp
    Negative Logits
    osi
    -0.70
     Tablet
    -0.69
     Sphere
    -0.68
    erenn
    -0.67
     Reach
    -0.62
     pop
    -0.62
     Prairie
    -0.60
    cised
    -0.60
     alien
    -0.58
    metics
    -0.58
    POSITIVE LOGITS
    wreck
    1.28
    roads
    1.02
    loads
    1.00
     conductor
    1.00
    ees
    0.97
    cars
    0.92
     wreck
    0.89
     cars
    0.88
     locom
    0.85
     trains
    0.85
    Act Density 0.033%

    No Known Activations