INDEX
    Explanations

    occurrences of the word "train."

    occurrences of the word "train" and its variations

    New Auto-Interp
    Negative Logits
    erenn
    -0.71
    osi
    -0.70
     Tablet
    -0.66
     Sphere
    -0.65
     Prairie
    -0.65
     Reach
    -0.63
    cised
    -0.62
     pop
    -0.62
    theless
    -0.62
    pring
    -0.61
    POSITIVE LOGITS
    wreck
    1.26
    ees
    1.03
    roads
    0.98
     wreck
    0.97
     conductor
    0.97
    loads
    0.94
     passenger
    0.90
     derail
    0.88
    liner
    0.86
    ee
    0.86
    Act Density 0.035%

    No Known Activations