INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    akit
    -0.14
     cruiser
    -0.14
    774
    -0.14
     soft
    -0.14
     Cruiser
    -0.14
     shit
    -0.14
     passive
    -0.13
    esel
    -0.13
    fuck
    -0.13
    /loose
    -0.13
    POSITIVE LOGITS
     CO
    0.19
     IPCC
    0.19
     USA
    0.19
     Carbon
    0.18
     carbon
    0.17
    carbon
    0.16
     Gore
    0.16
     fossil
    0.16
    CO
    0.16
    -job
    0.16
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.