INDEX
    Explanations

    constructs and references to recent findings or observations in academic or scientific contexts

    New Auto-Interp
    Negative Logits
     Infórmanos
    -1.13
     Signalez
    -0.90
     الحره
    -0.90
    帖最后由
    -0.90
     Administrativna
    -0.89
    Tikang
    -0.88
    ロウィン
    -0.86
     tartalomajánló
    -0.85
     nakalista
    -0.85
     Normdatei
    -0.85
    POSITIVE LOGITS
    1
    0.50
    3
    0.43
    6
    0.43
    0
    0.41
    4
    0.41
    0.41
    ↵↵
    0.41
    en
    0.39
    5
    0.39
    8
    0.39
    Act Density 7.614%

    No Known Activations