INDEX
    Explanations

    phrases indicating importance or significance

    phrases that indicate singular identification or significance

    New Auto-Interp
    Negative Logits
    ooks
    -0.66
     cores
    -0.64
    ãĤµ
    -0.62
     Palestin
    -0.60
     fences
    -0.59
    inas
    -0.58
    older
    -0.58
     warranties
    -0.58
     cliffs
    -0.58
    inity
    -0.58
    POSITIVE LOGITS
     Hundred
    0.91
     hundred
    0.89
     dimensional
    0.86
    rency
    0.81
     Thousand
    0.75
     thing
    0.74
     Piece
    0.73
    anchester
    0.73
    eree
    0.72
     thousand
    0.72
    Act Density 0.149%

    No Known Activations