INDEX
    Explanations

    occurrences of the word "of" in various contexts

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.01
    2:0.06
    3:0.09
    4:0.06
    5:0.04
    6:0.12
    7:0.20
    8:0.04
    9:0.04
    10:0.13
    11:0.15
    Negative Logits
     ingred
    -1.82
    ebus
    -1.77
    nces
    -1.76
    ivating
    -1.63
    kered
    -1.61
    ked
    -1.52
     Caf
    -1.48
     clut
    -1.47
    paces
    -1.46
    uts
    -1.46
    POSITIVE LOGITS
    Poly
    1.81
     sin
    1.58
    mol
    1.56
    poly
    1.53
     Brawl
    1.51
     misdem
    1.49
     miscarriage
    1.49
     deflation
    1.40
     Poly
    1.39
     DV
    1.38
    Act Density 0.000%

    No Known Activations