INDEX
    Explanations

    numerical values and ranges

    New Auto-Interp
    Negative Logits
    utenberg
    -0.69
     Flavoring
    -0.68
    ota
    -0.67
    park
    -0.67
    awar
    -0.65
     Parish
    -0.65
    aceous
    -0.64
    Ĥª
    -0.62
     Alic
    -0.62
     Parker
    -0.60
    POSITIVE LOGITS
    31
    1.26
    33
    1.15
    34
    1.14
    32
    1.13
    35
    1.12
     31
    1.12
    30
    1.11
    36
    1.09
    37
    1.08
    38
    1.07
    Act Density 0.077%

    No Known Activations