INDEX
    Explanations

    numerical values and their context within a monetary or quantity framework

    New Auto-Interp
    Negative Logits
     dieß
    -0.82
     itſelf
    -0.77
     ainfi
    -0.74
     dépens
    -0.73
     nothwendig
    -0.72
     leaſt
    -0.71
     vœux
    -0.70
     enfans
    -0.70
     lèvres
    -0.68
     étoient
    -0.68
    POSITIVE LOGITS
     or
    1.01
     fucking
    0.85
     damn
    0.83
     million
    0.82
     goddamn
    0.78
     thousand
    0.77
    something
    0.76
     something
    0.75
    fucking
    0.75
     hundred
    0.73
    Act Density 0.294%

    No Known Activations