INDEX
    Explanations

    directory and file-related terms

    New Auto-Interp
    Negative Logits
     Efq
    -1.33
     pleaſure
    -1.23
     Shakspeare
    -1.23
     Мексичка
    -1.21
     Jefus
    -1.19
    __":
    
    -1.18
     Monfieur
    -1.17
     Theſe
    -1.17
     estekak
    -1.17
     Anſ
    -1.16
    POSITIVE LOGITS
     Bru
    0.94
    bru
    0.74
     bru
    0.73
     (
    0.73
    ,
    0.69
    0.69
    Bru
    0.69
     into
    0.67
    dir
    0.67
     m
    0.66
    Act Density 0.192%

    No Known Activations