INDEX
    Explanations

    references to discussions about topics and questions for further analysis

    New Auto-Interp
    Negative Logits
     myſelf
    -0.54
     itſelf
    -0.51
    ſelf
    -0.49
     uſed
    -0.49
     ſch
    -0.48
     Keuangan
    -0.44
     faſt
    -0.43
     Ilmu
    -0.43
     circonst
    -0.42
    Jîn
    -0.42
    POSITIVE LOGITS
     discussed
    0.66
     discuss
    0.63
    discussed
    0.61
     discusses
    0.56
     Discuss
    0.56
    詳しくは
    0.55
     detailed
    0.52
    discuss
    0.52
     discus
    0.51
    OGND
    0.49
    Act Density 0.544%

    No Known Activations