INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     متعلقه
    -0.50
     oorlog
    -0.49
    avras
    -0.47
     MwSt
    -0.46
    kuuta
    -0.44
    âce
    -0.43
    romyalgia
    -0.43
     kepada
    -0.43
     terhadap
    -0.43
     للمعارف
    -0.42
    POSITIVE LOGITS
     the
    0.83
     conversations
    0.78
     discussions
    0.75
    IUrlHelper
    0.75
     ویکی‌پدی
    0.72
    ContentAsync
    0.70
     BoxDecoration
    0.70
     ComVisible
    0.69
     society
    0.68
     debates
    0.66
    Act Density 0.002%

    No Known Activations