INDEX
    Explanations

    phrases indicating separation or distinction

    New Auto-Interp
    Negative Logits
    dotenv
    -0.82
    orges
    -0.68
    SharedCtor
    -0.67
    type
    -0.65
    o
    -0.64
    styleType
    -0.62
    e
    -0.62
    volles
    -0.61
     type
    -0.61
    brahim
    -0.59
    POSITIVE LOGITS
     apart
    2.02
    apart
    1.86
     Apart
    1.67
     APART
    1.50
    Apart
    1.49
     aside
    1.43
    Aside
    1.32
    aside
    1.31
     Aside
    1.31
     appart
    1.13
    Act Density 0.077%

    No Known Activations