INDEX
    Explanations

    temperature-related terms and their variations

    New Auto-Interp
    Negative Logits
    angel
    -0.16
    oit
    -0.16
    izar
    -0.15
    ToWorld
    -0.14
    oire
    -0.14
     umbrella
    -0.14
    dest
    -0.14
    engl
    -0.14
    uder
    -0.14
    .shtml
    -0.13
    POSITIVE LOGITS
    .sponge
    0.16
    oug
    0.15
     Jac
    0.15
    POSITE
    0.14
    awan
    0.14
    ako
    0.14
    EventListener
    0.14
    окол
    0.14
    ajor
    0.13
    afka
    0.13
    Act Density 0.005%

    No Known Activations