INDEX
    Explanations

    sentences discussing personal experiences or reflections on societal changes

    New Auto-Interp
    Negative Logits
    AlterField
    -0.49
    까지
    -0.47
    bufio
    -0.46
    +"_
    -0.46
    Personendaten
    -0.43
    InstrumentedTest
    -0.42
    RectangleBorder
    -0.42
     quoi
    -0.41
    وشن
    -0.39
    ißt
    -0.39
    POSITIVE LOGITS
     now
    3.04
    now
    2.71
    今では
    2.38
    Now
    2.33
     Now
    2.32
     maintenant
    2.29
     ahora
    2.26
     теперь
    2.25
     sekarang
    2.23
     teraz
    2.13
    Act Density 1.538%

    No Known Activations