INDEX
Explanations
possessive forms indicating ownership or association
New Auto-Interp
Negative Logits
oeff
-0.18
ãĥ«ãĤ¯
-0.16
herits
-0.16
anna
-0.15
colleagues
-0.15
teammates
-0.15
utow
-0.14
wife
-0.14
cela
-0.14
predecessors
-0.14
POSITIVE LOGITS
version
0.21
own
0.20
premier
0.20
answer
0.20
leading
0.19
most
0.19
ayne
0.19
newest
0.18
finest
0.18
Answer
0.17
Activations Density 0.139%