INDEX
    Explanations

    phrases related to existential concerns and interactions

    New Auto-Interp
    Negative Logits
    Personendaten
    -0.69
     Baillargeon
    -0.68
    发表于
    -0.67
     للاسماء
    -0.64
     AssemblyCulture
    -0.59
     Réponses
    -0.59
     oblige
    -0.58
    Становништво
    -0.55
    =$?
    -0.55
    nloa
    -0.55
    POSITIVE LOGITS
     thing
    0.69
     things
    0.63
    ような
    0.61
    0.52
    škas
    0.52
     것
    0.51
    のも
    0.51
    things
    0.50
    のは
    0.49
     것이
    0.49
    Act Density 0.036%

    No Known Activations