INDEX
    Explanations

    phrases referring to a specific position or state

    phrases indicating various states of being or circumstances

    New Auto-Interp
    Negative Logits
     Flavoring
    -0.96
    favorite
    -0.68
    origin
    -0.66
     Dresden
    -0.63
     Medal
    -0.63
    âĵĺ
    -0.61
     antid
    -0.61
    avorite
    -0.60
    glomer
    -0.60
     Brist
    -0.59
    POSITIVE LOGITS
    WARE
    0.72
    Ĥİ
    0.72
    achy
    0.70
     haste
    0.69
    ossession
    0.68
     toile
    0.65
    nir
    0.64
    docker
    0.63
     hurry
    0.63
    phabet
    0.61
    Act Density 0.044%

    No Known Activations