INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    umb
    -0.49
     paz
    -0.48
     average
    -0.48
    resource
    -0.48
     बारे
    -0.46
    󠁢
    -0.46
     yön
    -0.46
    ورت
    -0.46
    fs
    -0.45
    <0xEA>
    -0.45
    POSITIVE LOGITS
    __':
    
    1.00
    __':
    0.95
    __":
    
    0.95
    __":
    0.94
    BeginContext
    0.93
     Jefus
    0.87
     &___
    0.86
    ValueStyle
    0.86
    SequentialGroup
    0.81
     pleaſure
    0.80
    Act Density 0.141%

    No Known Activations