INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     zonnepanelen
    -0.09
     grada
    -0.09
     CENT
    -0.08
     gde
    -0.08
     ma
    -0.08
     rada
    -0.08
    -0.08
     scars
    -0.08
     ciel
    -0.08
     שכ
    -0.08
    POSITIVE LOGITS
    foobar
    0.08
     wholesale
    0.08
    Collect
    0.08
    Snack
    0.08
    Trivia
    0.07
     dispense
    0.07
    =N
    0.07
     Barre
    0.07
    ]]]
    0.07
    =a
    0.07
    Act Density 0.033%

    No Known Activations