INDEX
    Explanations

    introductory phrases or transitions

    New Auto-Interp
    Negative Logits
    cimientos
    -0.52
     szint
    -0.52
    enschappen
    -0.51
     Al
    -0.50
     információ
    -0.50
    ровок
    -0.49
    脚注の使い方
    -0.47
     we
    -0.47
    -
    -0.47
     azon
    -0.46
    POSITIVE LOGITS
     contextLoads
    0.74
     Efq
    0.71
     otomatig
    0.66
    titleMargin
    0.65
     Theſe
    0.65
     Proced
    0.62
    ſelf
    0.61
     ARXIV
    0.61
     Perſ
    0.60
    ſelves
    0.60
    Act Density 0.025%

    No Known Activations