INDEX
    Explanations

    references to authors and their contributions in academic contexts

    New Auto-Interp
    Negative Logits
     itſelf
    -1.05
     Efq
    -0.96
     themſelves
    -0.92
     myſelf
    -0.90
     disambiguazione
    -0.89
     poffible
    -0.83
     Eſ
    -0.81
     pleaſure
    -0.81
     Majefty
    -0.81
     faſt
    -0.81
    POSITIVE LOGITS
     Von
    0.61
     Di
    0.60
    Di
    0.58
     De
    0.55
     Van
    0.54
    Von
    0.54
    pk
    0.52
    Van
    0.52
    mstyle
    0.51
     La
    0.51
    Act Density 0.224%

    No Known Activations