INDEX
    Explanations

    positive adjectives and attributes

    descriptors of quality and popularity

    New Auto-Interp
    Negative Logits
    helm
    -0.86
    vier
    -0.82
    orthy
    -0.80
    reasonable
    -0.79
    SPONSORED
    -0.79
    similar
    -0.77
    utics
    -0.76
    biased
    -0.76
    illance
    -0.75
    serious
    -0.74
    POSITIVE LOGITS
     USS
    0.93
     confines
    0.88
     Crimson
    0.84
    liest
    0.81
     Millennium
    0.79
     Dakota
    0.79
     Falk
    0.77
     Trinity
    0.77
     Notting
    0.77
     Golden
    0.76
    Act Density 0.264%

    No Known Activations