INDEX
Explanations
phrases or constructions indicating possession or ownership
New Auto-Interp
Negative Logits
.gs
-0.17
suggestion
-0.15
ona
-0.15
âĻ
-0.15
Alexandria
-0.15
erti
-0.14
ypy
-0.14
ãĥ³ãĤ¹
-0.14
erah
-0.14
nox
-0.14
POSITIVE LOGITS
bilt
0.17
æĴ®
0.15
thane
0.15
åĥ
0.15
à¥ĭम
0.15
Brun
0.14
isson
0.14
riott
0.14
ieties
0.14
Yates
0.13
Activations Density 0.015%