INDEX
    Explanations

    numerical values and measurements that imply size and capacity

    New Auto-Interp
    Negative Logits
    +#+#
    -0.64
    enschappelijke
    -0.61
    ArgsConstructor
    -0.61
     مرئيه
    -0.60
     spokes
    -0.59
    unmodifiable
    -0.58
     kasarigan
    -0.57
    Suara
    -0.55
    ruppen
    -0.55
     ralla
    -0.54
    POSITIVE LOGITS
    DeleteBehavior
    0.53
     average
    0.50
     dalamnya
    0.49
     completos
    0.49
    hilangan
    0.47
    Gemeinden
    0.47
     disparu
    0.47
     équi
    0.47
     normalt
    0.47
    单个
    0.46
    Act Density 0.301%

    No Known Activations