INDEX
    Explanations

    statements about personal experiences and reflections

    the phrase "what it's like" or similar expressions describing subjective experiences or perspectives.

    New Auto-Interp
    Negative Logits
     apapun
    -0.42
     anything
    -0.40
    anything
    -0.39
     Anything
    -0.36
    enapa
    -0.36
    miento
    -0.36
     égard
    -0.35
     qualquer
    -0.34
     Cualquier
    -0.34
     mengapa
    -0.34
    POSITIVE LOGITS
    -------------</
    0.58
     nakalista
    0.53
    
    0.52
    はこんな感じ
    0.52
     prawdzi
    0.52
    ecuted
    0.52
    Koordinaten
    0.52
     wahre
    0.50
     فريبيس
    0.50
     createState
    0.50
    Act Density 0.129%

    No Known Activations