INDEX
    Explanations

    intensifiers or modifiers that emphasize the degree of a particular quality or state

    New Auto-Interp
    Negative Logits
    참고
    -0.81
     betweenstory
    -0.77
     متعلقه
    -0.71
    GEBURTSDATUM
    -0.68
    λίου
    -0.66
     saites
    -0.61
    Біографія
    -0.60
    afficheront
    -0.59
     SwiftUI
    -0.59
    -0.59
    POSITIVE LOGITS
     many
    0.65
     fewer
    0.59
     Sebab
    0.57
     more
    0.55
     dozens
    0.54
     lots
    0.52
     everything
    0.52
     one
    0.52
     nearly
    0.50
     loads
    0.49
    Act Density 0.304%

    No Known Activations