INDEX
    Explanations

    words expressing emotions and actions related to creativity and personal experiences

    New Auto-Interp
    Negative Logits
    []
    
    -0.88
    '));
    
    -0.88
    )];
    
    -0.88
    ]='\
    -0.87
    ]),
    
    -0.85
     AssemblyTitle
    -0.84
    )}</
    -0.84
    ]');
    -0.83
    "]);
    
    -0.82
    ")));
    
    -0.82
    POSITIVE LOGITS
    .
    0.54
     and
    0.46
     sendiri
    0.45
    ρώ
    0.43
     in
    0.43
     with
    0.43
     edin
    0.43
     răm
    0.43
    هر
    0.42
     when
    0.42
    Act Density 0.230%

    No Known Activations