INDEX
    Explanations

    phrases related to considerations and factors influencing decisions or contexts

    New Auto-Interp
    Negative Logits
    antan
    -0.15
    criptors
    -0.15
     therein
    -0.14
    ilt
    -0.14
    509
    -0.13
    fell
    -0.13
    ufe
    -0.12
    axter
    -0.12
    lararası
    -0.12
    utable
    -0.12
    POSITIVE LOGITS
     mind
    0.36
     upper
    0.32
     foremost
    0.30
    mind
    0.29
     firmly
    0.28
     forefront
    0.28
    upper
    0.28
     Mind
    0.27
     front
    0.25
     minds
    0.25
    Act Density 0.042%

    No Known Activations