INDEX
    Explanations

    phrases with indefinite articles, particularly those that indicate a specific instance or scenario

    New Auto-Interp
    Negative Logits
    šet
    -0.17
    chet
    -0.16
    lü
    -0.16
    aż
    -0.15
    ISMATCH
    -0.15
    á»Ļn
    -0.15
    eking
    -0.15
    PLEX
    -0.14
    crete
    -0.14
    à¥įà¤Łà¤°
    -0.14
    POSITIVE LOGITS
     nutshell
    0.22
     manner
    0.21
     effort
    0.18
     hurry
    0.16
    306
    0.15
     
    0.15
    acer
    0.15
     span
    0.15
     Nut
    0.15
     rum
    0.14
    Act Density 0.137%

    No Known Activations