INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ocrat
    -0.08
     Gift
    -0.08
    Gift
    -0.08
    isor
    -0.08
    Observable
    -0.08
    รีย
    -0.07
    ocratic
    -0.07
    Bras
    -0.07
    BR
    -0.07
    -0.07
    POSITIVE LOGITS
     montagne
    0.09
     Mountain
    0.08
    _angle
    0.08
     montaña
    0.08
     angle
    0.08
     mountain
    0.08
     hens
    0.08
     الطرف
    0.08
     diện
    0.07
     Farbe
    0.07
    Act Density 0.002%

    No Known Activations