INDEX
    Explanations

    references to spare time or effort

    New Auto-Interp
    Negative Logits
    lapsingToolbar
    -0.66
    TagMode
    -0.63
    RenderAtEndOf
    -0.58
     burbujas
    -0.56
     verdades
    -0.56
     protoimpl
    -0.55
     reflexiones
    -0.54
     diensten
    -0.54
     plumas
    -0.52
     hembra
    -0.52
    POSITIVE LOGITS
     aff
    0.61
     Hans
    0.59
     bla
    0.58
     ff
    0.58
    Hans
    0.57
     Han
    0.55
     FF
    0.55
    aff
    0.53
     Evil
    0.53
     Aff
    0.52
    Act Density 0.201%

    No Known Activations