INDEX
    Explanations

    references to things being at the center of attention or focus

    phrases that indicate a central or main focus in a context

    New Auto-Interp
    Negative Logits
    ishable
    -0.76
    utan
    -0.64
    à©
    -0.64
    é¾įå
    -0.61
    )--
    -0.60
    hua
    -0.60
    syn
    -0.60
     à¨
    -0.60
    ratom
    -0.59
     chops
    -0.59
    POSITIVE LOGITS
    Initialized
    0.78
     gravity
    0.77
    igm
    0.70
    rency
    0.69
    inence
    0.69
    ierre
    0.67
    EVA
    0.67
    pole
    0.66
    gie
    0.65
    dyl
    0.63
    Act Density 0.067%

    No Known Activations