INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    CppMethod
    -0.46
     Mors
    -0.44
     carreira
    -0.40
     Weid
    -0.40
    /**
    -0.38
     Soria
    -0.38
     Sicherung
    -0.38
     Morin
    -0.38
     Holman
    -0.37
     Dietz
    -0.37
    POSITIVE LOGITS
     Apple
    2.16
    Apple
    2.03
     APPLE
    1.60
    APPLE
    1.48
    apple
    1.45
     apple
    1.45
     Apples
    1.23
    苹果
    1.20
     苹果
    1.10
    Apples
    1.05
    Act Density 0.005%

    No Known Activations