INDEX
    Explanations

    phrases with the word "big"

    repeated mentions of the word "big" in various contexts

    New Auto-Interp
    Negative Logits
     briefly
    -0.70
     temporarily
    -0.69
     respectively
    -0.66
     rendered
    -0.66
     restored
    -0.65
     med
    -0.62
     para
    -0.62
     momentarily
    -0.62
     subsequently
    -0.62
     notwithstanding
    -0.61
    POSITIVE LOGITS
    big
    3.89
    Big
    2.25
    largest
    1.78
    huge
    1.77
    small
    1.73
     BIG
    1.69
     big
    1.56
     Big
    1.49
     bigger
    1.42
    little
    1.36
    Act Density 0.009%

    No Known Activations