INDEX
Explanations
references to authorship or attribution in texts
New Auto-Interp
Negative Logits
pleaſure
-0.73
IsContent
-0.68
myſelf
-0.61
whoſe
-0.61
purpoſe
-0.60
itſelf
-0.60
ſche
-0.59
doméstica
-0.57
XmlAccessType
-0.57
houſe
-0.56
POSITIVE LOGITS
by
0.87
by
0.86
oleh
0.73
kasarigan
0.72
geslacht
0.68
By
0.65
By
0.65
izr
0.62
بواسطة
0.60
BY
0.59
Activations Density 0.171%