INDEX
    Explanations

    instances of the word "review" and its variations

    New Auto-Interp
    Negative Logits
     itſelf
    -0.90
    RenderAtEndOf
    -0.87
    MLLoader
    -0.85
    makeConstraints
    -0.82
     RSITY
    -0.81
    ]--;
    -0.81
    VIRONMENT
    -0.80
    ICAGO
    -0.80
    leſs
    -0.79
    twimg
    -0.78
    POSITIVE LOGITS
     review
    1.03
     Review
    0.95
    review
    0.87
     reviews
    0.87
    Review
    0.76
     reviewers
    0.75
     REVIEW
    0.75
     reviewed
    0.74
    properties
    0.74
     comments
    0.72
    Act Density 0.105%

    No Known Activations