INDEX
    Explanations

    phrases related to providing evidence or support for a statement

    phrases related to supporting or validating statements or claims

    New Auto-Interp
    Negative Logits
    avery
    -0.86
    adel
    -0.75
    mouth
    -0.73
    kie
    -0.72
    erve
    -0.71
    ities
    -0.69
    acht
    -0.69
    obs
    -0.68
    eteenth
    -0.68
    isha
    -0.67
    POSITIVE LOGITS
    dates
    0.78
     shaky
    0.73
    rating
    0.72
    packs
    0.70
    dating
    0.70
    stretched
    0.69
     EVs
    0.68
     Rept
    0.67
    olicy
    0.65
     against
    0.64
    Act Density 0.020%

    No Known Activations