INDEX
    Explanations

    words related to the idea of creating or taking action towards improvement or achievement

    New Auto-Interp
    Negative Logits
    indir
    -0.15
    .Restrict
    -0.14
    anuts
    -0.14
    ientes
    -0.14
    odings
    -0.14
    icit
    -0.14
    AMI
    -0.14
    hangi
    -0.13
    inan
    -0.13
    ags
    -0.13
    POSITIVE LOGITS
     sure
    0.40
    sure
    0.31
     Sure
    0.30
    Sure
    0.27
     connections
    0.22
    SURE
    0.20
     dreams
    0.19
    Connections
    0.19
     Connections
    0.18
     waves
    0.18
    Act Density 0.076%

    No Known Activations