INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     kin
    -0.62
     Tribunal
    -0.60
     wedd
    -0.59
     Levin
    -0.58
     faithfully
    -0.58
     Sapp
    -0.58
     pudding
    -0.57
     marrow
    -0.56
     MEP
    -0.56
     perce
    -0.56
    POSITIVE LOGITS
    #
    3.91
    ##
    2.15
     #
    1.98
    /#
    1.87
    .#
    1.85
    ####
    1.67
    =#
    1.62
    ###
    1.61
     "#
    1.54
     (#
    1.53
    Act Density 0.008%

    No Known Activations