INDEX
    Explanations

    descriptive and evaluative terms

    New Auto-Interp
    Negative Logits
     }{}_{\
    0.64
     Física
    0.62
    ޏ
    0.62
     água
    0.62
     tento
    0.62
     Kamera
    0.61
     ඔබට
    0.60
     Roofing
    0.59
    وضح
    0.59
    0.58
    POSITIVE LOGITS
    0.89
    0.80
    0.78
    0.70
    0.68
    0.65
    0.65
    0.64
    0.63
    0.63
    Act Density 0.011%

    No Known Activations