INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    9
    0.74
    8
    0.69
    4
    0.68
    6
    0.68
    7
    0.63
    5
    0.58
     
    0.57
    including
    0.55
    0
    0.54
    armament
    0.51
    POSITIVE LOGITS
    0.57
     voda
    0.56
     tubig
    0.54
     вода
    0.51
     деньги
    0.50
    0.50
    本来
    0.49
    o
    0.49
     വസ്തു
    0.49
     uang
    0.47
    Act Density 0.007%

    No Known Activations