INDEX
    Explanations

    numbers and technical terms

    references to numeric data and measurements

    New Auto-Interp
    Negative Logits
    esides
    -0.60
    ,)
    -0.57
    76561
    -0.56
     resil
    -0.53
    ),"
    -0.52
    .),
    -0.50
    ="/
    -0.50
    "),
    -0.50
    .}
    -0.49
    portation
    -0.48
    POSITIVE LOGITS
     (
    1.92
     (?,
    1.54
     (.
    1.49
     ([
    1.49
     (-
    1.47
     (_
    1.46
     (!
    1.45
     (/
    1.44
     ('
    1.42
     ((
    1.40
    Act Density 0.315%

    No Known Activations