INDEX
    Explanations

    phrases indicating requirements or conditions

    New Auto-Interp
    Negative Logits
    AxisAlignment
    -0.17
     konkrét
    -0.14
     Ages
    -0.14
    ele
    -0.14
    adil
    -0.14
    gado
    -0.14
    Spl
    -0.14
    unger
    -0.14
    arella
    -0.14
    far
    -0.14
    POSITIVE LOGITS
    yor
    0.17
    ponge
    0.14
    iasi
    0.14
    é¡į
    0.14
    airo
    0.14
    umn
    0.14
    itan
    0.13
     æĻ´
    0.13
    awe
    0.13
    igli
    0.13
    Act Density 0.036%

    No Known Activations