INDEX
    Explanations

    scientific vs. fictional contrasts

    New Auto-Interp
    Negative Logits
    whitelist
    0.47
     хрони
    0.47
     tunes
    0.46
    chronic
    0.46
    Squad
    0.45
    seo
    0.45
     shouts
    0.42
    Dig
    0.42
    hang
    0.41
     школы
    0.41
    POSITIVE LOGITS
     seekBar
    0.45
     TRANSPORT
    0.43
    一个
    0.43
     chalkboard
    0.43
    ہوں
    0.42
    ۔
    0.42
     salt
    0.41
     fluorine
    0.41
    ),
    0.40
     হইয়৷
    0.40
    Act Density 0.001%

    No Known Activations