INDEX
    Explanations

    preferences and inclinations expressed through the word 'like' and its variations

    New Auto-Interp
    Negative Logits
     Paglinawan
    -0.68
    脚注の使い方
    -0.56
    ScreenState
    -0.56
     intptr
    -0.56
    localctx
    -0.54
    asanjo
    -0.54
    ridgeshire
    -0.54
     Commands
    -0.54
    MethodManager
    -0.53
    -0.53
    POSITIVE LOGITS
     likes
    2.55
     like
    2.34
    like
    2.34
     Like
    2.27
     LIKE
    2.25
    Like
    2.23
    likes
    2.23
    LIKE
    2.11
     liked
    2.11
     liking
    2.05
    Act Density 0.215%

    No Known Activations