INDEX
    Explanations

    words related to trains and training

    references to the word "train" in various contexts

    New Auto-Interp
    Negative Logits
    uid
    -0.75
    hed
    -0.74
    hedral
    -0.69
    hern
    -0.67
    hedon
    -0.66
     Fernandez
    -0.66
     Gawker
    -0.65
    cens
    -0.63
     Wid
    -0.62
    arag
    -0.62
    POSITIVE LOGITS
     train
    3.82
     trains
    2.87
     Train
    2.81
    train
    2.70
    Train
    2.55
     Amtrak
    1.62
     railway
    1.60
     training
    1.55
     trained
    1.54
     railroad
    1.51
    Act Density 0.013%

    No Known Activations