INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Prü
    0.42
     абстра
    0.40
     punctuation
    0.37
     abstract
    0.37
     Zariski
    0.36
    ksyon
    0.36
     Acne
    0.36
     Chardonnay
    0.36
     nonparametric
    0.36
     வணக்கம்
    0.36
    POSITIVE LOGITS
     airplane
    1.21
     автомоби
    1.13
     automobile
    1.09
    汽车
    1.08
     자동차
    1.06
     vliegtuig
    1.02
     aviation
    1.01
     авиа
    1.01
     trucking
    1.01
     aeroplane
    1.01
    Act Density 0.026%

    No Known Activations