INDEX
    Explanations

    phrases that indicate consideration or assessment of factors and their impact

    New Auto-Interp
    Negative Logits
    ſelves
    -0.56
    ſelf
    -0.53
     ſch
    -0.49
     Eſ
    -0.47
     kasarigan
    -0.47
    PCP
    -0.46
    ContentLoaded
    -0.45
     againſt
    -0.44
     المعيارى
    -0.44
     feroit
    -0.43
    POSITIVE LOGITS
     reactions
    0.44
     Reactions
    0.44
     реакции
    0.42
    กรณ์
    0.40
     reaction
    0.40
    QUIN
    0.39
     реак
    0.39
     reacciones
    0.38
    reaction
    0.37
     Reak
    0.37
    Act Density 0.012%

    No Known Activations