INDEX
    Explanations

    mentions of specific or definitive characteristics and details within a text

    New Auto-Interp
    Negative Logits
    <bos>
    -3.43
    /***
    
    -0.84
    ///**
    -0.74
    protected
    -0.67
    public
    -0.67
    -0.65
    
    
    -0.63
    //{
    
    -0.61
    /*!
    
    -0.61
     assistir
    -0.59
    POSITIVE LOGITS
     lidl
    1.30
     lele
    1.27
     milano
    1.25
     maroc
    1.24
     tramont
    1.24
     bandung
    1.23
     toledo
    1.21
     meis
    1.21
     wien
    1.21
     maneu
    1.20
    Act Density 0.074%

    No Known Activations