INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.59
     Masyarakat
    -0.56
    FirstResponder
    -0.55
    atario
    -0.55
    mappings
    -0.54
    -0.54
     Palme
    -0.53
     faveur
    -0.53
     Chant
    -0.51
     Jü
    -0.51
    POSITIVE LOGITS
     metal
    0.73
     metals
    0.69
     Metal
    0.69
    metal
    0.67
    metals
    0.66
    Metals
    0.65
     foil
    0.65
    ensement
    0.63
    worker
    0.63
    úrg
    0.63
    Act Density 0.119%

    No Known Activations