INDEX
    Explanations

    phrases related to performing tasks or activities

    the word "the" in various contexts

    New Auto-Interp
    Negative Logits
    SPONSORED
    -0.79
    GGGG
    -0.71
    hower
    -0.70
    ontent
    -0.68
    ONSORED
    -0.67
     "$:/
    -0.63
    VERTISEMENT
    -0.63
    neau
    -0.63
     Provides
    -0.62
    quished
    -0.61
    POSITIVE LOGITS
     unthinkable
    1.27
     same
    1.25
    same
    1.12
     math
    1.11
     opposite
    1.05
     maths
    1.00
     homework
    0.97
     job
    0.97
     trick
    0.97
     groundwork
    0.95
    Act Density 0.055%

    No Known Activations