INDEX
    Explanations

    phrases that indicate the presence and status of various conditions or issues

    "are" followed by an adjective

    New Auto-Interp
    Negative Logits
     its
    -1.42
     Its
    -1.15
    Its
    -1.13
     itself
    -1.06
    its
    -1.01
    它的
    -0.95
    -0.91
    itself
    -0.78
     it
    -0.74
     которое
    -0.74
    POSITIVE LOGITS
     themselves
    1.67
    themselves
    1.55
     themſelves
    1.37
    those
    1.16
     serem
    1.14
     ones
    1.12
     those
    1.10
     amelyek
    1.08
     которые
    1.05
    Those
    1.05
    Act Density 1.068%

    No Known Activations