INDEX
    Explanations

    Emphasis and important points

    New Auto-Interp
    Negative Logits
    一方面
    0.46
     efficiently
    0.42
     useful
    0.42
     interesting
    0.42
     intéressante
    0.41
     valuable
    0.41
    便利な
    0.41
     intéressant
    0.40
     Interesting
    0.39
    0.39
    POSITIVE LOGITS
    Seriously
    1.77
     seriously
    1.76
     Seriously
    1.73
    seriously
    1.70
     literally
    1.16
     absolutely
    1.13
    absolutely
    1.09
     Literally
    1.09
     Absolutely
    1.05
     absolutamente
    1.05
    Act Density 0.070%

    No Known Activations