INDEX
    Explanations

    simple and straightforward concepts

    instances of the word "simple"

    New Auto-Interp
    Negative Logits
    âĹ¼
    -0.87
     reckoned
    -0.78
     largeDownload
    -0.73
    raints
    -0.71
     Experts
    -0.70
    ITNESS
    -0.68
    ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
    -0.67
    hips
    -0.67
     Intellectual
    -0.63
     Nanto
    -0.63
    POSITIVE LOGITS
    tons
    1.18
     simple
    1.04
    wallet
    0.95
    simple
    0.94
     straightforward
    0.94
    minded
    0.92
    json
    0.89
    Simple
    0.86
     syrup
    0.81
     arithmetic
    0.78
    Act Density 0.017%

    No Known Activations