INDEX
    Explanations

    abstract concepts and their impact

    New Auto-Interp
    Negative Logits
    attva
    0.84
     тех
    0.83
     ​​
    0.83
     receptionist
    0.78
     vosotros
    0.75
     suv
    0.75
     appellant
    0.75
     которыми
    0.74
     mane
    0.74
     society
    0.72
    POSITIVE LOGITS
    Variance
    1.22
     Variance
    1.17
     variance
    1.15
    variance
    1.12
    Entropy
    1.07
    Lind
    1.07
    1.06
    ٹن
    1.06
    டுகிறது
    1.06
    Ost
    1.05
    Act Density 0.166%

    No Known Activations