INDEX
    Explanations

    phrases indicating a level of significance or importance regarding various subjects

    New Auto-Interp
    Negative Logits
    hazi
    -0.17
    amel
    -0.14
    INGS
    -0.14
     basically
    -0.14
    ÑģÑıÑĤ
    -0.14
    æŁ»
    -0.14
    amburg
    -0.14
    .ibatis
    -0.13
    زاÙĨ
    -0.13
    nox
    -0.13
    POSITIVE LOGITS
     anymore
    0.32
     necessarily
    0.29
     nor
    0.25
    nor
    0.18
    ë§Ŀ
    0.16
    çĽ
    0.15
    Ìģc
    0.15
     Nor
    0.15
    eks
    0.15
    ainter
    0.15
    Act Density 0.036%

    No Known Activations