INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    enumi
    -0.56
     also
    -0.56
     they
    -0.52
     Référence
    -0.50
    ::::::::
    -0.49
     actually
    -0.48
     effectively
    -0.47
     doesn
    -0.46
     say
    -0.45
     won
    -0.45
    POSITIVE LOGITS
    ReusableCell
    0.69
    NameInMap
    0.65
     varandra
    0.60
     himo
    0.58
    Према
    0.57
     ostavi
    0.57
     utafitiHapana
    0.55
    Попис
    0.54
    Билгалдахарш
    0.54
    Kjelder
    0.53
    Act Density 0.002%

    No Known Activations