INDEX
    Explanations

    statements indicating that more information can be found elsewhere

    references to completeness or entirety in information

    New Auto-Interp
    Negative Logits
    onis
    -0.69
    fu
    -0.68
    rosso
    -0.68
    _-
    -0.66
     Fever
    -0.65
    zy
    -0.64
    fw
    -0.64
    yne
    -0.61
    wo
    -0.61
    Ear
    -0.60
    POSITIVE LOGITS
     brunt
    0.93
     extent
    0.83
     underside
    0.80
     impetus
    0.78
     embodiment
    0.78
     alphabet
    0.78
     wording
    0.75
     nature
    0.74
     spectrum
    0.73
     fullest
    0.71
    Act Density 0.099%

    No Known Activations