INDEX
    Explanations

    expressions of self-reflection and internal struggle

    New Auto-Interp
    Negative Logits
    //
    -0.64
    gameserver
    -0.59
     nargin
    -0.58
    Géographie
    -0.57
    GeneratedCode
    -0.56
    writeFieldEnd
    -0.55
    WillAppear
    -0.54
     springfox
    -0.54
    aarrggbb
    -0.52
    BeforeClass
    -0.51
    POSITIVE LOGITS
    RegistryLite
    0.62
     thinking
    0.59
     thinks
    0.56
     wondering
    0.52
    think
    0.51
     think
    0.50
     berpikir
    0.50
     wished
    0.49
     forgetting
    0.47
     thoughts
    0.47
    Act Density 0.277%

    No Known Activations