INDEX
    Explanations

    mentions of specific company names and authoritative figures giving instructions

    New Auto-Interp
    Negative Logits
    UREAU
    -0.69
    ungalow
    -0.60
     quæ
    -0.56
     Vikipedi
    -0.54
    etermined
    -0.53
    -0.53
     cæ
    -0.50
     caufe
    -0.47
     SOBRE
    -0.45
    ²(
    -0.45
    POSITIVE LOGITS
     done
    0.72
    done
    0.64
     DONE
    0.60
     Done
    0.57
    doing
    0.54
     doing
    0.54
     Doing
    0.53
     xdrive
    0.52
    Doing
    0.52
     chapeau
    0.51
    Act Density 0.239%

    No Known Activations