INDEX
    Explanations

    words or sequences that include underscores or hyphens

    New Auto-Interp
    Negative Logits
     pleaſure
    -0.74
     itſelf
    -0.73
     purpoſe
    -0.72
     againſt
    -0.72
    ſelf
    -0.70
    ſelves
    -0.69
     ſche
    -0.69
     anſ
    -0.68
     auffi
    -0.66
     Anſ
    -0.66
    POSITIVE LOGITS
    LabelTagHelper
    0.69
     дописавши
    0.65
    RenderAtEndOf
    0.64
    uxxxx
    0.58
     ligiloj
    0.57
     >=",
    0.55
     AssemblyCulture
    0.53
     actionMode
    0.53
    unnitel
    0.51
    IZONTAL
    0.49
    Act Density 0.252%

    No Known Activations