INDEX
    Explanations

    terms related to academic and innovative contexts

    New Auto-Interp
    Negative Logits
    .datab
    -0.17
    anoia
    -0.17
    .typ
    -0.17
    abant
    -0.17
    AutoSize
    -0.17
    ży
    -0.15
    âng
    -0.15
     pers
    -0.15
    itter
    -0.15
    ýv
    -0.15
    POSITIVE LOGITS
    ullen
    0.18
    ÙħÛĮÙĦ
    0.15
    Łèĥ½
    0.14
     aure
    0.14
    aura
    0.13
    UIL
    0.13
    ajaran
    0.13
    eru
    0.13
    oren
    0.13
    ãĤ¹ãĥ¬
    0.13
    Act Density 0.012%

    No Known Activations