INDEX
    Explanations

    references to sudden increases or spikes in various contexts

    New Auto-Interp
    Negative Logits
    IRA
    -0.15
    orraine
    -0.15
     persistent
    -0.14
    گاÙĨÛĮ
    -0.14
    urre
    -0.14
    haul
    -0.14
     Whites
    -0.14
    ORIZONTAL
    -0.14
    imers
    -0.14
    anza
    -0.14
    POSITIVE LOGITS
    fen
    0.16
    etzt
    0.16
    y
    0.15
    felt
    0.15
    atal
    0.14
    SENT
    0.14
    oppel
    0.14
     Sommer
    0.13
     Economy
    0.13
    stry
    0.13
    Act Density 0.005%

    No Known Activations