INDEX
    Explanations

    novelty and contribution

    New Auto-Interp
    Negative Logits
     धना
    0.44
     legendary
    0.42
     تاريخ
    0.40
     gigantes
    0.39
     অবিশ্বাস্য
    0.39
    劇情
    0.39
    0.38
    astrous
    0.38
     ঘৃণা
    0.38
    转换为
    0.37
    POSITIVE LOGITS
     novel
    1.50
     novelty
    1.50
    Novel
    1.43
    novel
    1.41
     Novel
    1.39
     originality
    1.16
     noved
    1.05
     novedad
    1.02
     methodological
    0.96
     contribution
    0.94
    Act Density 0.042%

    No Known Activations