INDEX
    Explanations

    phrases related to intellectual or philosophical discussions and concepts

    complex ideas and concepts related to critical analysis and evaluation

    New Auto-Interp
    Negative Logits
    VL
    -0.65
     notor
    -0.62
    swick
    -0.60
    $.
    -0.59
    essage
    -0.58
    kef
    -0.58
    ãĥ¯ãĥ³
    -0.57
    flix
    -0.57
    Interstitial
    -0.56
    wikipedia
    -0.56
    POSITIVE LOGITS
     awaits
    0.69
    ¶
    0.68
     âĵĺ
    0.59
    "?
    0.59
     prompt
    0.56
     leaps
    0.55
     aside
    0.54
     Posted
    0.54
     huh
    0.53
     Dragonbound
    0.52
    Act Density 0.463%

    No Known Activations