INDEX
    Explanations

    Names of people

    New Auto-Interp
    Negative Logits
    >()
    -0.07
    ureau
    -0.07
    -war
    -0.06
    ...↵↵↵
    -0.06
     Deutsch
    -0.06
     advice
    -0.06
    ');?>
    -0.06
    	files
    -0.06
    -0.06
    ">↵↵↵
    -0.06
    POSITIVE LOGITS
    OME
    0.07
     Lucifer
    0.07
     segreg
    0.07
     Loves
    0.07
     Detroit
    0.07
    TexCoord
    0.06
    .S
    0.06
     TextField
    0.06
     limbs
    0.06
    oste
    0.06
    Act Density 0.002%

    No Known Activations