INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -0.77
     is
    -0.76
      
    -0.72
    <eos>
    -0.71
     T
    -0.70
    R
    -0.69
    K
    -0.68
     R
    -0.67
    -0.67
    H
    -0.66
    POSITIVE LOGITS
     Monfieur
    1.09
     Houſe
    0.98
     ་་
    0.94
     myſelf
    0.94
     Efq
    0.92
     houſe
    0.92
     itſelf
    0.90
     Reſ
    0.89
     InputDecoration
    0.88
     ſever
    0.87
    Act Density 1.344%

    No Known Activations