INDEX
Explanations
references to individuals or their actions related to authority or expertise
Follows possessive nouns
possessives like 's
New Auto-Interp
Negative Logits
المناصب
-0.76
للاسماء
-0.66
avoient
-0.64
yourselves
-0.62
SharedCtor
-0.61
pouvoit
-0.61
ThemeData
-0.61
végétale
-0.61
étoient
-0.59
dedans
-0.58
POSITIVE LOGITS
latest
0.85
newest
0.83
s
0.81
']").
0.79
%");
0.73
own
0.73
%}
0.72
his
0.70
"]);
0.69
entire
0.69
Activations Density 0.163%