INDEX
    Explanations

    pronouns and related words denoting people or groups

    references to specific individuals or pronouns indicating their actions

    New Auto-Interp
    Negative Logits
    é¾įåĸļ士
    -0.83
    çͰ
    -0.76
    inence
    -0.74
    Sense
    -0.74
    20439
    -0.73
     Defin
    -0.72
    MENT
    -0.70
     mosqu
    -0.70
    cies
    -0.69
    ãĥĥãĥī
    -0.68
    POSITIVE LOGITS
     Beckham
    0.62
     Ajax
    0.62
    rites
    0.59
     consent
    0.59
    /"
    0.58
     Kinect
    0.58
     Canaver
    0.57
     Couch
    0.57
     Franco
    0.57
    platform
    0.57
    Act Density 0.000%

    No Known Activations