INDEX
    Explanations

    quotes and references to increases in various contexts

    New Auto-Interp
    Negative Logits
     Dried
    -0.41
    |};
    -0.40
     fadeIn
    -0.40
    perative
    -0.38
     vivement
    -0.38
    bital
    -0.38
    madol
    -0.37
     gerekir
    -0.36
    同じく
    -0.36
    imply
    -0.36
    POSITIVE LOGITS
     increase
    0.90
     Increase
    0.68
    increase
    0.66
     decrease
    0.65
    Increase
    0.65
     שוליים
    0.58
    decrease
    0.56
    Decrease
    0.52
    quot
    0.52
     <<<<<<<<<<<<<<
    0.51
    Act Density 0.062%

    No Known Activations