INDEX
    Explanations

    specific verbs and actions in the text

    New Auto-Interp
    Negative Logits
    rait
    -0.17
     Bilg
    -0.17
    غاÙĦ
    -0.15
    ConverterFactory
    -0.15
    .ra
    -0.14
    ycastle
    -0.14
    tep
    -0.14
    YRO
    -0.14
     Inherits
    -0.14
    bsite
    -0.14
    POSITIVE LOGITS
     wid
    0.17
     æ¬
    0.16
     component
    0.16
     components
    0.15
    klad
    0.15
     Dawson
    0.14
    kup
    0.14
    imes
    0.14
    pas
    0.14
    DAO
    0.14
    Act Density 0.003%

    No Known Activations