INDEX
    Explanations

    Column or technical definitions

    New Auto-Interp
    Negative Logits
     Prose
    0.43
    פס
    0.40
    тивного
    0.39
    0.38
     செயல்படுத்த
    0.38
    ستوى
    0.38
    दर्भ
    0.37
    aksanakan
    0.37
    actos
    0.37
     Sacred
    0.36
    POSITIVE LOGITS
     victories
    0.52
    0.50
     있으며
    0.46
     victory
    0.46
     refugees
    0.46
     douleurs
    0.45
     sympath
    0.45
     insurgents
    0.45
     distributor
    0.45
     melanch
    0.44
    Act Density 0.001%

    No Known Activations