INDEX
    Explanations

    specific labels or identifiers related to figures and sections in scientific documents

    New Auto-Interp
    Negative Logits
     nahilalakip
    -1.06
    IUrlHelper
    -0.74
    thenReturn
    -0.69
     BoxDecoration
    -0.62
    ValueStyle
    -0.61
    estris
    -0.57
    ViewFeatures
    -0.57
     iconFacebook
    -0.57
     EconPapers
    -0.56
    merksam
    -0.55
    POSITIVE LOGITS
     forti
    0.47
    yes
    0.44
    pcm
    0.44
     Dac
    0.44
    SPATH
    0.43
    częściej
    0.43
    μφωνα
    0.43
    チュ
    0.42
     doPost
    0.42
     Lijst
    0.42
    Act Density 0.595%

    No Known Activations