INDEX
    Explanations

    citations and references in a scholarly context

    New Auto-Interp
    Negative Logits
     Cath
    -0.14
    iran
    -0.14
     (
    -0.14
     Tap
    -0.14
    inson
    -0.14
    è·Ŀ
    -0.14
    ashi
    -0.14
    orde
    -0.14
    unce
    -0.14
    atan
    -0.13
    POSITIVE LOGITS
     bekl
    0.18
    .opend
    0.17
    iest
    0.15
    arov
    0.15
    iated
    0.14
    iert
    0.14
    986
    0.14
    uppe
    0.14
    uze
    0.14
    ValueChanged
    0.14
    Act Density 0.005%

    No Known Activations