INDEX
    Explanations

    individual nouns or noun phrases that contain the word "of"

    New Auto-Interp
    Negative Logits
     partName
    -0.76
     CBI
    -0.62
     ss
    -0.61
     cf
    -0.61
     prices
    -0.61
     barg
    -0.60
     passers
    -0.60
     galleries
    -0.59
     scores
    -0.59
     appra
    -0.59
    POSITIVE LOGITS
    rontal
    1.33
    sky
    1.00
    uture
    0.96
    eatures
    0.95
    ortunately
    0.93
    unction
    0.92
    icial
    0.92
    rost
    0.88
    rame
    0.87
     course
    0.87
    Act Density 0.027%

    No Known Activations