INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    The
    0.43
    Accuracy
    0.42
     liczba
    0.42
    ECTION
    0.41
    ruzione
    0.40
    0.40
     그러나
    0.39
    وكان
    0.39
    0.39
    CONF
    0.38
    POSITIVE LOGITS
     extensively
    0.63
     actively
    0.57
     적극
    0.57
    積極的に
    0.55
     proactively
    0.54
     consciously
    0.52
     активно
    0.52
     intentionally
    0.50
     প্রতিশ্রুতি
    0.47
     frequently
    0.46
    Act Density 0.012%

    No Known Activations