INDEX
    Explanations

    terms related to concepts or abstract ideas

    New Auto-Interp
    Negative Logits
    GEBURTSDATUM
    -0.68
    PerformLayout
    -0.65
    Notae
    -0.65
    Jeografia
    -0.61
     firstly
    -0.60
    Revenir
    -0.59
     oprot
    -0.59
    انجليز
    -0.58
    featureID
    -0.58
    hens
    -0.58
    POSITIVE LOGITS
    Idea
    1.23
    idea
    1.15
     Ideas
    1.15
     Idea
    1.14
    Ideas
    1.09
    ideas
    1.06
     ideas
    0.93
     IDEA
    0.93
    Ide
    0.91
    IDEA
    0.91
    Act Density 0.119%

    No Known Activations