INDEX
    Explanations

    processes and occurrences

    New Auto-Interp
    Negative Logits
     찾는
    0.41
    ეგ
    0.39
    بان
    0.38
    arke
    0.38
     Greenpeace
    0.38
     bulund
    0.37
     boasted
    0.37
     casually
    0.36
     필요한
    0.36
     boasting
    0.36
    POSITIVE LOGITS
     occuring
    1.10
     occurring
    1.09
    現象
    1.08
     occurs
    1.01
     occur
    0.98
     phenomenon
    0.96
    现象
    0.96
     phenomena
    0.91
     occured
    0.90
    が発生
    0.89
    Act Density 0.039%

    No Known Activations