INDEX
    Explanations

    references to numerical data or rankings

    words followed by specific conjunctions or next tokens

    New Auto-Interp
    Negative Logits
    Бахар
    -0.62
     OnTriggerEnter
    -0.52
     layui
    -0.52
    ViewImports
    -0.51
    AnimationsModule
    -0.50
    providedIn
    -0.47
     '\\;'
    -0.46
    Населення
    -0.44
     namanya
    -0.44
    waitKey
    -0.44
    POSITIVE LOGITS
     kaarangay
    0.38
    Jereo
    0.38
     Italijanski
    0.37
    ngu
    0.36
    følge
    0.35
     наве
    0.35
     ivelany
    0.35
    Хьажоргаш
    0.34
    tagHelperRunner
    0.34
    NOPQRST
    0.33
    Act Density 0.095%

    No Known Activations