INDEX
    Explanations

    sentences that indicate research findings or conclusions

    New Auto-Interp
    Negative Logits
     ModelExpression
    -0.83
    IntoConstraints
    -0.78
     Efq
    -0.76
     Anſ
    -0.75
     protoimpl
    -0.75
    astéroïdes
    -0.74
     juſt
    -0.73
    saraba
    -0.73
     houſe
    -0.72
     Theſe
    -0.72
    POSITIVE LOGITS
    Keywords
    0.77
     Keywords
    0.69
    <eos>
    0.66
    abstractmethod
    0.65
    KEYWORDS
    0.58
     Abstract
    0.55
     rez
    0.55
    Abstract
    0.52
     INTRODUCTION
    0.50
     keywords
    0.50
    Act Density 0.554%

    No Known Activations