INDEX
    Explanations

    phrases related to training and education

    New Auto-Interp
    Negative Logits
    _training
    -0.32
     trained
    -0.31
     training
    -0.31
    _train
    -0.31
     Training
    -0.28
    _TRAIN
    -0.28
    Training
    -0.28
    training
    -0.25
    trained
    -0.23
    è®Ń
    -0.23
    POSITIVE LOGITS
    ees
    0.30
    ee
    0.30
     wheels
    0.21
     Wheels
    0.20
    ning
    0.20
    ings
    0.19
    ining
    0.18
    /testing
    0.17
     session
    0.16
     sessions
    0.16
    Act Density 0.037%

    No Known Activations