INDEX
    Explanations

    parentheses indicating numerical values with high importance

    parentheses or bracketed content

    New Auto-Interp
    Negative Logits
     Academy
    -0.71
     glare
    -0.71
     Lumpur
    -0.70
     zoo
    -0.68
     Lynd
    -0.68
     wildlife
    -0.66
     resur
    -0.66
     sav
    -0.66
     park
    -0.65
     lull
    -0.64
    POSITIVE LOGITS
    including
    1.65
    excluding
    1.54
    such
    1.51
    typically
    1.48
    which
    1.44
    usually
    1.42
    often
    1.39
    except
    1.38
    either
    1.38
    meaning
    1.37
    Act Density 0.153%

    No Known Activations