INDEX
    Explanations

    themes related to emotional and psychological struggles

    New Auto-Interp
    Negative Logits
    '),
    
    -1.24
    "){
    
    -1.23
     ')
    
    -1.20
    '){
    
    -1.20
    '))
    
    -1.19
    ")){
    
    -1.18
    "):
    
    -1.18
    '):
    
    -1.17
     ")
    
    -1.16
    '],
    
    -1.10
    POSITIVE LOGITS
    .
    1.05
     because
    0.63
     while
    0.62
    ;
    0.61
    ,
    0.60
     regardless
    0.60
     when
    0.58
     instead
    0.58
     with
    0.56
     and
    0.54
    Act Density 13.296%

    No Known Activations