INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bot
    -0.69
    Binding
    -0.67
    DeleteBehavior
    -0.63
     operator
    -0.63
     estekak
    -0.61
     binding
    -0.61
    tagHelperRunner
    -0.60
    Datuak
    -0.60
     Bake
    -0.60
    binding
    -0.60
    POSITIVE LOGITS
    Ext
    0.95
    ext
    0.93
     Ext
    0.83
     citoy
    0.67
     CIT
    0.62
     Diony
    0.61
     violenza
    0.60
    intios
    0.60
     huk
    0.58
     biens
    0.58
    Act Density 0.038%

    No Known Activations