INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     occitan
    -0.08
     Chapters
    -0.08
    -0.08
     Sandwich
    -0.08
     favoriser
    -0.08
     progreso
    -0.07
     zure
    -0.07
    -0.07
     sli
    -0.07
     slurry
    -0.07
    POSITIVE LOGITS
     DOUBLE
    0.08
    double
    0.08
    _double
    0.08
    -double
    0.08
    設備
    0.08
    0.07
     ممت
    0.07
    Double
    0.07
    printed
    0.07
     میدان
    0.07
    Act Density 0.002%

    No Known Activations