INDEX
Explanations
pronouns and related words denoting people or groups
references to specific individuals or pronouns indicating their actions
New Auto-Interp
Negative Logits
é¾įåĸļ士
-0.83
çͰ
-0.76
inence
-0.74
Sense
-0.74
20439
-0.73
Defin
-0.72
MENT
-0.70
mosqu
-0.70
cies
-0.69
ãĥĥãĥī
-0.68
POSITIVE LOGITS
Beckham
0.62
Ajax
0.62
rites
0.59
consent
0.59
/"
0.58
Kinect
0.58
Canaver
0.57
Couch
0.57
Franco
0.57
platform
0.57
Activations Density 0.000%