INDEX
    Explanations

    understanding relationships

    New Auto-Interp
    Negative Logits
    を使用
    0.49
     эшлә
    0.38
     occupies
    0.37
     produziert
    0.37
     exceeds
    0.37
     Produktion
    0.37
     används
    0.37
     equals
    0.37
     verwenden
    0.37
     performs
    0.36
    POSITIVE LOGITS
     capire
    0.75
     understand
    0.62
     понять
    0.61
     know
    0.58
     узнать
    0.57
     entender
    0.57
     hiểu
    0.55
    了解
    0.53
     ascertain
    0.51
     tahu
    0.50
    Act Density 0.430%

    No Known Activations