INDEX
    Explanations

    phrases related to providing explanations or defining concepts

    the word "where" in various contexts, indicating locations or settings

    New Auto-Interp
    Negative Logits
    ³³³³³³³³
    -0.67
    ³³³
    -0.63
    rieve
    -0.63
     Dance
    -0.61
    rolet
    -0.60
     Epidem
    -0.60
     Rite
    -0.58
    ³³³³³³³³³³³³³³³³
    -0.58
     Doctrine
    -0.58
    ME
    -0.58
    POSITIVE LOGITS
    upon
    1.57
    soever
    1.27
    abouts
    1.05
    fore
    1.04
    owler
    0.74
    ever
    0.73
    ipl
    0.73
    ver
    0.72
    anooga
    0.72
    players
    0.71
    Act Density 0.067%

    No Known Activations