INDEX
    Explanations

    instances of the word "this"

    New Auto-Interp
    Negative Logits
    tones
    -0.88
    oran
    -0.79
    acers
    -0.78
    fed
    -0.77
    bats
    -0.74
    rics
    -0.70
    aughtered
    -0.70
     Roads
    -0.69
    iak
    -0.68
    anches
    -0.67
    POSITIVE LOGITS
     phenomenon
    1.07
     trope
    1.05
     discrepancy
    1.04
     particular
    0.95
     anomaly
    0.90
     happening
    0.90
     happen
    0.88
     limitation
    0.87
     predicament
    0.87
     pecul
    0.86
    Act Density 0.074%

    No Known Activations