INDEX
    Explanations

    references to rankings or numerical positions

    punctuation marks, particularly periods

    New Auto-Interp
    Negative Logits
     lag
    -0.64
    etheless
    -0.60
    staking
    -0.60
    ãĥ¼ãĥĨ
    -0.59
     confidentiality
    -0.55
     luc
    -0.54
     disturbed
    -0.54
     enthus
    -0.54
     pear
    -0.54
     herds
    -0.52
    POSITIVE LOGITS
     1
    0.91
     2
    0.84
    1
    0.81
     3
    0.80
     8
    0.77
     4
    0.76
     7
    0.76
     6
    0.75
     22
    0.75
     5
    0.73
    Act Density 0.024%

    No Known Activations