INDEX
    Explanations

    phrases related to changes or fluctuations in various contexts

    increase or decrease of metrics

    New Auto-Interp
    Negative Logits
    </thead>
    -0.50
    -0.48
     queſta
    -0.46
     betweenstory
    -0.45
    ConstraintMaker
    -0.45
     ligiloj
    -0.44
    ether
    -0.44
     Administrativna
    -0.42
     mathématiques
    -0.42
    -0.42
    POSITIVE LOGITS
     decrease
    0.66
     Decrease
    0.65
    Decrease
    0.63
     increase
    0.59
     disminución
    0.59
     Increase
    0.58
    Increase
    0.56
    posedge
    0.56
     decline
    0.55
    decrease
    0.54
    Act Density 0.748%

    No Known Activations