INDEX
Explanations
possessive pronouns and associated contexts
New Auto-Interp
Negative Logits
eil
-0.15
zon
-0.14
ible
-0.14
ç¨ĭ度
-0.14
æ»ħ
-0.13
ATABASE
-0.13
ç·ł
-0.13
ipel
-0.13
ockey
-0.13
ä¹
-0.13
POSITIVE LOGITS
element
0.25
appointed
0.22
Element
0.21
natural
0.19
stride
0.19
senses
0.18
Element
0.18
accustomed
0.18
true
0.18
glory
0.18
Activations Density 0.094%