INDEX
    Explanations

    comparisons or statements involving similarity

    phrases that compare or liken something to another thing

    New Auto-Interp
    Negative Logits
    icious
    -0.76
    opian
    -0.68
     explor
    -0.67
    cffffcc
    -0.66
    isco
    -0.66
    iosyncr
    -0.66
    espie
    -0.66
    bid
    -0.65
    Returns
    -0.64
    endiary
    -0.64
    POSITIVE LOGITS
     usual
    0.82
     adults
    0.81
     normal
    0.77
     bandits
    0.76
     criminals
    0.76
     idiots
    0.73
     crazy
    0.72
     Ancients
    0.69
     Indians
    0.68
     Adults
    0.68
    Act Density 0.142%

    No Known Activations