INDEX
    Explanations

    numerical references and citations within academic articles

    New Auto-Interp
    Negative Logits
     
    -0.17
     Laur
    -0.17
     rent
    -0.16
     Gaut
    -0.15
     Ret
    -0.15
    390
    -0.15
     Perc
    -0.15
     rents
    -0.15
    rent
    -0.15
     rentals
    -0.15
    POSITIVE LOGITS
    onis
    0.15
    Official
    0.15
    æ»
    0.15
    emoc
    0.15
     stripslashes
    0.14
    erras
    0.14
    åª
    0.14
    Creators
    0.14
    .pref
    0.14
    emas
    0.14
    Act Density 0.009%

    No Known Activations