INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Fraction
    -0.07
     nunca
    -0.07
     ulcer
    -0.06
     Nah
    -0.06
    -0.06
     Fist
    -0.06
    .Mutable
    -0.06
    _xor
    -0.06
     Bills
    -0.06
     HIT
    -0.06
    POSITIVE LOGITS
     standpoint
    0.08
    ":-
    0.08
    virt
    0.07
     =================================
    0.07
     Feinstein
    0.07
     viewpoint
    0.07
    0.06
     viewpoints
    0.06
     POV
    0.06
     (↵↵
    0.06
    Act Density 0.010%

    No Known Activations