INDEX
Explanations
references to ownership or possessive attributes related to people or entities
New Auto-Interp
Negative Logits
eil
-0.15
edis
-0.15
eum
-0.15
ornado
-0.14
avou
-0.14
.assertIs
-0.13
overs
-0.13
ed
-0.13
themselves
-0.13
ocaly
-0.13
POSITIVE LOGITS
own
0.27
iglia
0.19
own
0.17
próp
0.16
ups
0.16
sights
0.16
erre
0.16
roots
0.16
Own
0.16
Own
0.15
Activations Density 0.024%