INDEX
    Explanations

    mentions of theories or hypotheses

    New Auto-Interp
    Negative Logits
    tein
    -0.74
     Termin
    -0.66
    glers
    -0.66
     Diablo
    -0.66
     Crescent
    -0.66
     Delay
    -0.66
     DRAG
    -0.65
    onite
    -0.64
     OD
    -0.64
    ドラ
    -0.63
    POSITIVE LOGITS
     entertained
    0.77
     tending
    0.76
     favoring
    0.74
     pos
    0.73
     nour
    0.72
     promulg
    0.71
    leans
    0.71
     underpin
    0.70
    sembly
    0.69
    render
    0.69
    Act Density 0.108%

    No Known Activations