INDEX
Explanations
possessive forms or references to ownership
New Auto-Interp
Negative Logits
agi
-0.16
Choice
-0.15
choice
-0.15
Squ
-0.15
ones
-0.15
eline
-0.14
corporation
-0.14
allet
-0.14
choice
-0.14
company
-0.13
POSITIVE LOGITS
contents
0.21
wner
0.19
contents
0.19
esser
0.18
inhabitants
0.17
VIC
0.15
ogui
0.15
impact
0.15
ilk
0.15
æŀľ
0.15
Activations Density 0.234%