INDEX
    Explanations

    concepts related to measurement and evaluation in various contexts

    New Auto-Interp
    Head Attr Weights
    0:0.01
    1:0.01
    2:0.17
    3:0.10
    4:0.23
    5:0.03
    6:0.04
    7:0.14
    8:0.04
    9:0.04
    10:0.08
    11:0.05
    Negative Logits
    Congratulations
    -1.69
    eah
    -1.67
     Congratulations
    -1.64
    rats
    -1.59
     Flavoring
    -1.54
    yssey
    -1.52
    fell
    -1.51
    prus
    -1.50
    Joined
    -1.49
    -1.48
    POSITIVE LOGITS
     cruc
    1.44
     manageable
    1.39
     alloy
    1.39
     stroke
    1.36
     heel
    1.35
     サーティワン
    1.34
     practicable
    1.33
     appropriate
    1.33
     prescribed
    1.32
    ":["
    1.32
    Act Density 0.029%

    No Known Activations