INDEX
    Explanations

    references to specific cultural or media elements

    New Auto-Interp
    Negative Logits
    ↵↵
    -0.82
    -0.79
     in
    -0.79
    -0.79
        
    -0.78
     and
    -0.78
     on
    -0.77
                
    -0.77
            
    -0.76
     
    -0.76
    POSITIVE LOGITS
     viciss
    1.86
     Sén
    1.83
     seksi
    1.81
     mef
    1.80
     Souha
    1.80
     panik
    1.73
     sappi
    1.72
     fta
    1.71
     fup
    1.69
     Bibl
    1.68
    Act Density 0.912%

    No Known Activations