INDEX
    Explanations

    phrases related to consistency in various contexts

    New Auto-Interp
    Negative Logits
     Graff
    -0.76
     McCullough
    -0.73
     Breen
    -0.71
     vergüenza
    -0.69
    TemporalType
    -0.68
     Tapia
    -0.68
    razi
    -0.67
     Byers
    -0.67
     Zier
    -0.66
    pe
    -0.66
    POSITIVE LOGITS
     consistent
    1.96
     Consistency
    1.95
    consistent
    1.91
     consistency
    1.87
    Consistency
    1.81
     Consistent
    1.76
    consistency
    1.70
    Consistent
    1.68
     inconsistent
    1.60
     Consist
    1.53
    Act Density 0.108%

    No Known Activations