INDEX
    Explanations

    phrases related to actions directed at individuals and their consequences

    New Auto-Interp
    Negative Logits
    спе
    -0.45
    basicConfig
    -0.40
    @
    -0.39
    we
    -0.38
    ↵↵
    -0.38
    freep
    -0.38
     We
    -0.37
    تر
    -0.37
     Ver
    -0.36
    Sucesor
    -0.36
    POSITIVE LOGITS
    GEBURTSDATUM
    0.91
     universe
    0.91
     heaven
    0.90
     تضيفلها
    0.89
     Universe
    0.84
     earth
    0.80
     EconPapers
    0.80
     heavens
    0.79
    universe
    0.79
     moon
    0.78
    Act Density 0.273%

    No Known Activations