INDEX
    Explanations

    words related to horses and horse-related terminology

    New Auto-Interp
    Negative Logits
    REDACTED
    -0.77
    ERC
    -0.74
    Ô
    -0.71
    FactoryReloaded
    -0.70
     admitting
    -0.70
    ãģ®éŃĶ
    -0.68
    ħĭ
    -0.68
    piracy
    -0.63
    PsyNetMessage
    -0.62
    Favorite
    -0.62
    POSITIVE LOGITS
    izons
    1.70
    oscope
    1.09
    rid
    1.08
    osc
    1.02
    cru
    0.94
    seless
    0.91
    itas
    0.90
    izon
    0.88
    ror
    0.84
    gin
    0.79
    Act Density 0.002%

    No Known Activations