INDEX
    Explanations

    phrases that indicate relationships or comparisons between different entities

    New Auto-Interp
    Negative Logits
     is
    -0.80
     a
    -0.65
     an
    -0.60
     sheet
    -0.59
     was
    -0.58
     en
    -0.57
    ase
    -0.56
     sur
    -0.56
    minus
    -0.56
    一种
    -0.55
    POSITIVE LOGITS
    ſelves
    0.85
     ISNI
    0.84
     '\\;'
    0.81
     leaſt
    0.78
     ainfi
    0.78
     Akismet
    0.77
     tetrach
    0.76
     MainAxisSize
    0.76
     jotka
    0.75
    ValueStyle
    0.75
    Act Density 0.272%

    No Known Activations