INDEX
Explanations
specific verbs and actions in the text
New Auto-Interp
Negative Logits
rait
-0.17
Bilg
-0.17
غاÙĦ
-0.15
ConverterFactory
-0.15
.ra
-0.14
ycastle
-0.14
tep
-0.14
YRO
-0.14
Inherits
-0.14
bsite
-0.14
POSITIVE LOGITS
wid
0.17
æ¬
0.16
component
0.16
components
0.15
klad
0.15
Dawson
0.14
kup
0.14
imes
0.14
pas
0.14
DAO
0.14
Activations Density 0.003%