INDEX
    Explanations

    Version incompatibility

    New Auto-Interp
    Negative Logits
     altru
    -0.08
     Steiner
    -0.08
    ですね
    -0.07
    яло
    -0.07
     слово
    -0.07
     стане
    -0.07
     ясно
    -0.07
     niente
    -0.07
     mosque
    -0.07
     aloud
    -0.07
    POSITIVE LOGITS
    Compatibility
    0.15
     compatibility
    0.15
     incompatible
    0.15
    compat
    0.15
     Compatibility
    0.15
    版本
    0.15
     kompat
    0.14
     incompat
    0.14
     compatible
    0.14
    compatible
    0.14
    Act Density 0.023%

    No Known Activations