INDEX
    Explanations

    numerical data and references to figures or tables

    New Auto-Interp
    Negative Logits
    zcze
    -0.16
    ALA
    -0.15
    ourage
    -0.15
    à¥Ģà¤ķरण
    -0.15
    iferay
    -0.15
    .animations
    -0.14
    ahoma
    -0.13
    zier
    -0.13
    eyse
    -0.13
     Advocate
    -0.13
    POSITIVE LOGITS
    11
    0.25
    10
    0.24
    12
    0.23
    22
    0.20
    09
    0.20
    9
    0.19
    08
    0.18
    25
    0.18
    13
    0.18
    23
    0.17
    Act Density 0.047%

    No Known Activations