INDEX
    Explanations

    numbers and comparisons

    New Auto-Interp
    Negative Logits
     Bigger
    0.38
     maggior
    0.38
    larger
    0.37
    ترین
    0.37
    akrishna
    0.37
    zijde
    0.36
    ård
    0.35
    Larg
    0.34
    চ্ছেদ
    0.34
    ärast
    0.34
    POSITIVE LOGITS
     smaller
    1.25
    smaller
    1.11
     мень
    1.09
     Smaller
    1.08
    Smaller
    1.08
     shorter
    1.07
     lesser
    1.07
     kleinere
    0.93
     mindre
    0.86
     kleiner
    0.84
    Act Density 0.074%

    No Known Activations