INDEX
    Explanations

    technical and detailed language used in documents or articles

    phrases related to effectiveness or performance of actions and processes

    New Auto-Interp
    Negative Logits
     Originally
    -0.50
     Reconstruction
    -0.49
    Drag
    -0.48
     Transparency
    -0.48
     Fract
    -0.46
     Latest
    -0.46
     Drag
    -0.45
     nutshell
    -0.45
     Torrent
    -0.45
     Collider
    -0.44
    POSITIVE LOGITS
    '."
    0.81
    .).
    0.75
    ]."
    0.75
    !".
    0.72
    atever
    0.70
     anyway
    0.70
    ).[
    0.69
    .'"
    0.67
    )."
    0.67
    '.
    0.67
    Act Density 3.917%

    No Known Activations